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1. Geometrical vectors and vector fields 

2. Calculating with the div, grad and curl 

3. Index notation, unit vectors and coordinate systems 

4. Maxwell's equations and the Lorentz Force 

5. Gauge transformations and a particle in an electromagnetic field 

6. The Laplace equation and the method of images 

7. Separation of variables and Legendre polynomials 

8. Electric and magnetic multipole moments 
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19. Revision of special relativity 

20. Covariant Maxwell's equations and the transformation of fields 
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1 Vector Calculus and Field Theories 

Electrodjmamics is a theory o^ fields, and all matter enters the theory in the form o^ densi- 
ties. All modern physical theories are field theories, from general relativity to the quantum 
fields in the Standard Model and string theory. 

You will be familiar with particle theories such as classical mechanics, where the fun- 
damental object is characterised by a position vector and a momentum vector. Ignoring 
the possible internal structure of the particles, they have six degrees of freedom (three 
position and three momentum components). Fields, on the other hand, are characterised 
by an infinite number of degrees of freedom. Let's look at some examples: 

A vibrating string: Every point x along the string has a displacement y, which is a de- 
gree of freedom. Since there are an infinite number of points along the string, the 
displacement y{x) is a field. The argument x denotes a location on a line, so we call 
the field one-dimensional. 

Landscape altitude: With every point on a surface {x,y), we can associate a number 
that denotes the altitude h. The altitude h{x,y) is a two-dimensional field. Since the 
altitude is a scalar, we call this a scalar field. 

Temperature in a volume: At every point {x,y,z) in the volume we can measure the 
temperature T, which gives rise to the three-dimensional scalar field T(x,y,z). 

In this course we are interested in vector fields. In the remainder of this chapter we will 
construct the theory of vector calculus from the various different notions of a vector. 

1.1 Vectors 

What is a vector? We often say that a vector is a geometrical object with a magnitude and 
direction, and we represent this graphically using an arrow. However, the arrow is not the 
only geometrical object with a magnitude and direction, as Fig. 1 shows. The stack has a 
direction (the grey arrow) and a magnitude given by the density of sheets. The magnitude 
of the thumbtack is given by the area of the flat surface, and the magnitude of the sheaf is 
given by the density of the lines. Often we do want to use arrows to represent vectors, but 
other times the stack, thumbtack or sheaf are more natural pictures of what is going on. 




arrow stack thumbtack sheaf 



Figure 1: Geometrical objects with a magnitude and direction. 
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Figure 2: A capacitor with different distances between the two charges plates. 



As an example, consider the capacitor C shown in Fig. 2a. The charge Q on the plates 
produces an electric field inside the capacitor. The direction of the vector d gives the 
orientation of the capacitor, and its magnitude the plate separation. Similarly, the vector 
E denotes the electric field inside the capacitor, which has both a magnitude and direction. 

When we increase the distance between the capacitor plates from d to d' while keep- 
ing the potential V constant, the electric field inside the capacitor changes from E to E'. 
Graphically, this is represented in Fig. 2b as a longer vector d' and a shorter vector E'. 
Therefore, the arrow is a natural pictorial representation of the plate separation (because 
it grows along with the separation). However, the vector E shrinks as d grows (since the 
electric field decreases), and the stack is a more natural representation of the vector E. 
As the plates are pulled apart, the sheets of the stack are pulled apart also, lowering the 
sheet density of the stack, and hence lowering the magnitude of the vector, as required. 




Figure 3: The inner product between vectors. 



Next, consider the inner product (also known as dot product or scalar product) between 
the vectors d and E. From the physics of capacitors, we know that this is a scalar (namely 
the potential difference V"), regardless of the separation between the plates: 

E-d = E'-d' = y = ^ = constant. (1.1) 

This has a straightforward meaning when we consider the arrows and stacks. The value 
of the inner product is equal to the number of times the arrow pierces the sheets in the 
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stack. When there is an angle 6 between the directions of the arrow and the stack, the 
number of pierced sheets is reduced by a factor cosd, exactly as you would expect. We 
therefore always have to pair up an arrow with a stack in an inner product (see Fig. 3a). 

We can also take the inner product between a thumbtack and a sheaf In this case the 
value of the inner product is given by the number of times the sheaf pierces the surface 
of the thumbtack (see Fig. 3b). Again, this value is invariant under continuous squeezing 
and stretching of space. 

We can also give a geometrical meaning to the cross product. Consider the cross prod- 
uct between two arrows, shown in Fig 4. The two arrows very naturally form the sides 
of a parallelogram with area ab sind, which coincides with the magnitude of the vector 
c = a X b. The natural graphical representation of a cross product is therefore the thumb- 
tack, since the direction of the thumbtack is automatically perpendicular to the plane of 
the parallelogram. Can you construct the cross product of two stacks? 




Figure 4: The cross product between vectors a and b. 



In these lecture notes we denote vectors by boldface characters and vector components 
by regular italicised characters (as usual). Sometimes it is also instructive to indicate ex- 
actly what kind of geometrical object our vector is. In those cases we follow the convention 
that arrow vectors have an arrow over the s5rmbol and sheafs have a double arrow. Stacks 
have a horizontal line under the vector symbol and thumbtacks have a circle. We write 

AxB = C and DxE = F. (1.2) 

o — — 

The embellishments are not strictly necessary, and most of the time we'll leave them out; 
in practice we can always deduce the type of vector from the context. There is a special 
nomenclature for the arrow, stack, thumbtack and sheaf that predates these geometri- 
cal pictures, and we will use this towards the end of the course when we discuss special 
relativity: 

arrow = contravariant vector 
stack = covariant vector 
thumbtack = covariant vector capacity 

sheaf = contravariant vector density 

The covariant and contravariant vectors are the most important, since they play a central 
role in relativity. 

Exercise 1.1: Show that (axb)-c is geometrically a volume (a capacity). What is (axb)-c? 
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Figure 5: A vector and its components in different coordinate systems. 



In addition to multiplying arrows with stacks we multiply capacities with densities to 
obtain scalars. You are already familiar with this in the context of mass and charge den- 
sities: you multiply the mass density of a body with its volume (a capacity) to get its mass 
(a scalar). Similarly, you multiply the charge density of a line charge with the length of 
the line (the capacity) to get the total charge (the scalar). 

This pictorial approach to vector calculus is treated in detail in a little hook called Geomet- 
rical Vectors by Gabriel Weinreich (Chicago Lectures in Physics, 1998), which would make 
excellent supplementary reading material. 



It is important to distinguish between a vector and its components. The arrow vector v is 
an object that is completely independent of any coordinate system, as shown in Fig. 5a (it 
has a length and a direction). We can decompose v into components x and y relative to 
some coordinate system spanned by the orthonormal basis vectors and e^: 



V = 



with 



(o) 



and 



(1.3) 



as shown in Fig. 5b. (You probably know these basis better as = i and Cy = j.) Alter- 
natively, we can decompose the same vector v into components x' and y' relative to some 
other coordinate system spanned by the vectors Bx' and By/: 

X = x' + y' Cy' = xex + yCy . (1.4) 



The vector v does not depend on the coordinate system, only its components do; v has the 
same length and points in the same direction in all three instances in Fig. 5. 



1 .2 A geometrical representation of vector fields 

Now that we are familiar with a whole family of geometrical vectors, we can construct 
vector fields. A vector field has a geometrical vector of some sort at every point in space. 
But before we go into more detail we first define what we mean by a scalar field $. 
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scalar field with equipotential planes stacks form the gradient field 

Figure 6: The scalar field. 

If the scalar field is continuous, we can think of it as a collection of equipotential surfaces, 
as shown in Fig. 6. Each point in space lies on exactly one of the equipotential surfaces, 
which determines the value of the field at that point. 

Next, we can take a small "melon bailer" and scoop out a little three-dimensional 
sphere in the neighbourhood of such a point. You can see in Fig. 6 that this creates a 
small stack, with the direction given by the steepest ascent of the equipotential surfaces. 
We can repeat this procedure around every point in space, giving a stack vector at every 
point in space. This is of course exactly the gradient of the scalar field, which we can now 
see is a stack vector field F. We therefore have 

F = gradO. (1.5) 

In physics we often add a minus sign (which is just a redefinition of F), because "stuff flows 
downhill". When we follow a path through the stack vector field F from point 1 to point 2, 
the line integral along the path depends only on the end points: 

2 

F-dr = 02-3'i. (1-6) 

One possible physical interpretation of this integral is the work done moving a particle 
through the vector field from point 1 to point 2. If the vector field can be written as the 
gradient of a potential, then the work done depends only on the end points. 

Next, consider a series of stack vectors as shown in Fig. 7a. If we place these stacks in 
the neighbourhood of each point in space, we create a stack vector field A. However, as you 
can see in Fig. 7b, they will not line up neatly into continuous equipotential planes. This 
is therefore not a gradient field. Rather, the discontinuities form a bundle of lines that 
remind us strongly of a sheaf vector. Indeed, the stack vector field that we have created 
has a vector property that is graphically represented by these sheafs. The simplest vector 
property of a vector field (apart from the vectors at each point in space, which we already 
represented by the stacks) is the curl of the vector field. The sheaf vectors in Fig. 7b are a 
graphical representation of the curl: 

J = curlA. (1.7) 
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the discontinuities form a sheaf vector vield 
Figure 7: The stack vector field. 



To gain some intuition that this is indeed the curl, imagine a loop around one of the lines 
in the sheaf. On the left, the loop will pierce fewer sheets of the stack than on the right. 
Since the density of the sheets is the magnitude of the vector, this means that the vector 
on the right will be larger than on the left. If the vectors describe a force, there will be a 
torque on the loop, causing a rotation. This is exactly the property of a vector field that is 
measured by the curl. 

Now that we know that the sheaf vectors form the curl of the field A, we can immedi- 
ately derive an important theorem. When A is a gradient field (A = grad$), there are no 
discontinuities (see Fig. 6), and therefore the curl of A vanishes. Therefore 

curl(gradO) = 0, (1.8) 

for any scalar field O. It is kind of obvious from the pictures, but exact why this is as good 
a mathematical proof as any other requires some deep thinking on your part! 

Another theorem that becomes almost trivial to prove is Stokes' theorem. Imagine that 
we create a closed loop in a stack vector field, as shown in Fig. 8. Let the loop be called 
C and the surface that the loop encompasses S. We can define the line integral around 
the loop as §qA- dr, which is just the number of discontinuities enclosed by C. We can 
calculate this differently by integrating the density of the discontinuities over the surface 
S. But this is nothing more than the surface integral over curl A. We then arrive at 

i A-dr= [ curlA-dS, (1.9) 

JC~ Js - o 



where the surface is naturally defined as a thumbtack. This is Stokes' theorem. You are 
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the number of discontinuities in a loop C is the sheaf density times the surface S 

Figure 8: Stokes' theorem. 



familiar with this theorem in the form of Ampere's law, in which the magnetic field along 
a closed loop is proportional to the current through the surface spanned by the loop. 

When we create the stack vector field A, there is no beginning or end to the discontinu- 
ities that make up the sheaf J . This means that for a closed surface every line that goes 
in must come out, and the net effect is zero: 



curlA-dS = 0. (1.10) 

s 

It is important to note that this is true for the curl of A, but not necessary for A itself For 
example, we can create a vector field H not from stacks, but from sheafs instead. When 
we do this, we will again encounter the possibility that line densities of the sheafs in 
neighbouring points do not match. In this case we have sheaf discontinuities, as shown in 
Fig. 9. 



A sheaf field with discontinuities 

Figure 9: Sheaf discontinuities. The stars are end points and the dots are starting points. 

We can call the density of these discontinuities p, and define this as the divergence of H: 

p = divH. (1.11) 
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This is a scalar quantity, and behaves just as you would expect from the divergence, 
namely as the origin of field lines. 

Since we have just seen that the curl of a vector field A does not have any sheaf dis- 
continuities, we immediately obtain the well-known theorem 

div(curlA) = 0. (1.12) 

Again, the proof is implicit in the geometrical representation. 




The net number of lines out of the surface 
is given by the discontinuities in the volume 



Figure 10: Gauss' theorem. 



Finally, we can prove Gauss' theorem for vector fields. Consider the sheaf vector field 
H and the volume V shown in Fig. 10. The surface of the volume is denoted by S. The 
net number of lines coming out of the surface S must be identical to the net number of 
discontinuities inside the volume V (where start points are positive and end points are 
negative). Integrating over these two densities, we obtain 

U-dS= f divHdV. (1.13) 

° Jv 

This is Gauss' theorem. You are familiar with this theorem in electrostatics, where the 
flux of the electric field lines out of a closed surface is proportional to the total charge 
encapsulated by the surface. 

Later in the course we will derive Maxwell's equations in macroscopic materials, in 
which the fields E, B, D and H play a crucial role. In geometric vector notation, these 
equations read 

curlE = 

dt 

=> 

divD = p 

dD ^ 
curlH = + J 

- dt 

divB = 0, (1.14) 
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with the constitutive relations D = eE and B = /iH. Note how the Maxwell equations re- 
spect the geometric aspects of the vectors (e.g., where the curl of a stack is a sheaf), but 
the constitutive relations do not. This is because these are material properties that break 
the scale invariance of the geometric vectors. 

1.3 Div, grad and curl 

The geometrical representation is nice, and it gave us some powerful theorems, but in 
practice we still need to do actual calculations. In this section we will develop the mathe- 
matical tools you need to do these calculations. 




Figure 11: An altitude and temperature map with "equipotential" lines. 

First, consider the scalar field. The interesting aspect of such a field is how the values 
of the field change when we move to neighbouring points in space, and in what direction 
this change is maximal. For example, in the altitude field (with constant gravity) this 
change determines how a ball would roll on the surface, and for the temperature field it 
determines how the heat flows. Note that in both these examples there are "equipotential 
planes" of constant altitude (where the "plane" is a line) and temperature (see Fig. 11). 
When we take our "melon bailer" and carve out stack vectors in these maps, we obtain 
the steepest ascent in the altitude map, and the direction of heat flow in the temperature 
map. Previously we called this the gradient. 

Let the scalar field be denoted by fix,y, z). Then the change in the x direction (denoted 
by i) is given by 

fix + h,y,z)- fix,y,z)i dfix,y,z)i 

lim 1= 1. (1.15) 

h^o h dx 

Similar expressions hold for the change in the y and z direction, and in general the spatial 
change of a scalar field is given by 

-J-i + J-3 + J-\^^Vf^ grad f, (1.16) 
ox oy oz 
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which we identified with the gradient of f. The "nabla" or "del" symbol V is a differential 
operator, and it is also a (stack) vector: 



.a .a .3 

V= i— +j— + k— . 

- dx dy dz 

Clearly, this makes a stack vector field out of a scalar field. 



(1.17) 



Next, consider the curl of a vector field. In the previous section we saw that we need to 
construct a closed loop and see how the vectors change when we go around exactly once (see 
Fig. 12). Let the (stack) vector field be given by A with components Ax{x,y,z), Ay{x,y,z) 
and Az{x,y,z). Note that each component A^, Ay and A^ of the vector field A generally 
depends on all three directions x, y and z. It is absolutely crucial that you appreciate this, 
because most errors in calculations tend to originate from the false belief that Ax depends 
only on x, Ay depends only on y, etc. 

Consider the four-part infinitesimal closed loop in Fig. 12, starting at point {x-l,y -I), 
going to {x + l,y - 1), ix + l,y + l) and via (x-l,y + 1) back to (x-l,y- 1), as shown in figure 
12. The area of the square is 41^. The accumulated change of the vector field around this 
loop is given by the projection of A along the line elements. We evaluate all four sides of 
the infinitesimal loop in Fig. 12. For example, the line element that stretches from x-l to 
x + l at y-l is oriented in the x direction, so 1 directed along this part is 



f2l\ 


loj 



(1.18) 



with a minus sign because the line segment points in the negative x-direction. We take 
the inner product of 1 with the stack vector A at the point ix,y- l,z): 



A= [Axix,y-l,z) , Ayix,y-l,z) , Azix,y -l,z)) . 



(1.19) 



y+l 



y-l 



A^{x,y+I,z) 



Ay{x-I,y,z) , 


+ 




/\x(x,y-/,z) 



Ay{x+I,y,z) 



x-l X x+l 



Figure 12: Calculating the curl of a vector field A. 
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Note that all components of A are evaluated at the same point. The inner product then 
becomes 



A.-1—[Ax Ay A2] ■ 



f-2l\ 





= -2lAx(x,y-l,z). 



(1.20) 



This is a scalar, but it changes from point to point in space. 

We repeat the same procedure for the other three line elements. Make sure you get 
the orientation right, because the top horizontal part is directed in the +x direction. The 
corresponding term gets a plus sign. Now add them all up and get A • 1 around the loop. 
We take the limit of Z — 0, so we replace every I with dl. We can then write for the 
infinitesimal loop 



^ A-dl = 2dl [Axix,y- dl,z) + Ayix + dl,y,z)- Axix,y + dl,z)-Ayix-dl,y,z)] . 

We can divide both sides by the area 4dl^, which gives 

'Axix,y - dl,z)-Axix,y + dl,z) Ay{x - dl,y,z)-Ay{x + dl,y,z)^ 



(1.21) 



A-d\ = 



2dl 



2dl 



(1.22) 



According to Stokes' theorem the left-hand side is proportional to the sheaf vector field 
curl A in the direction perpendicular to the surface enclosed by the loop. When we take 
the limit of ^ the vector field curIA becomes almost constant, and the surface area 
factorizes. At the same time, the right-hand side turns into partial derivatives: 



dl^ls 



4cZZ2j/-'A-^S=4rfZ2 



1 .0 (dAy dA. 

curlA-k(2dZ)2 = (curlA)2 = ' ^ 



dx dy 



(1.23) 



where (curl A)^ means the z component of curlA. 

We have made a loop in two dimensions (the xy plane), while our space is made of three 
dimensions. We can therefore make two more loops, in the xz and yz planes, respectively. 
The quantities §^^X-d\ and §y^X-d\ then form the y and x components of the sheaf vector 
field J = curl A: 



J = 



dA^ M, 



dy dz 



1 + 



dAr M, 



. dz 



dx 



j + 



'dAy dAx 
, dx dy 



(1.24) 



Every Ax, Ay and value is taken at the same point in space {x,y,z), which leads to 
the value of J at that point {x,y,z). Previously, we defined this relationship as J = curl A, 
so now we have an expression of this relationship in terms of differential operators. In 
compact matrix notation, the curl can be written as a determinant 



J= VxA= 



1 J 

dx d. 
Ax 



k 

dz 



Ay A2 



(1.25) 
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Sometimes, when there are a lot of partial derivatives in an expression or derivation, it 
saves ink and space to write the derivative didx as d^, etc., with lower indices. Both 5^ 
and Ax etc., are covariant vectors (stacks). We want to make a distinction in our notation 
of the components when the vectors are contravariant (the arrow and the sheaf), and we 
shall write these with upper indices. 



A^{x,y+I,z) 




Figure 13: Calculating the divergence of a vector field A. 

Finally, we consider the divergence of a vector field. Consider the volume element V - (2Z)^ 
at point r - xi + yj + 2k and a vector field A{x,y,z), as shown in figure 13. We assume 
again that I is very small. From Gauss' theorem, the divergence is given by the number of 
discontinuities in the sheaf vector field inside the volume, which is directly related to the 
flux through the surface. All we need to do to find the divergence is calculate the net flux 
through the surface (and divide by the infinitesimal surface area). 

To calculate the flux in the x direction we subtract the incoming and outgoing flux at 
the two surfaces of the cube that are perpendicular to the x direction. The incoming flux is 
given by the inner product of A at point (x-l,y,z) with the area S = 4l^i. For the outgoing 
flux we have the same inner product, but evaluated at point (x + l,y, z). The net flux in the 
X direction is therefore 

[ A-dS = Al'^\A{x + l,y,z)-\-A{x-l,y,z)-\] 

= [A% + Z,3/,2) - A% -l,y,z)] , (1.26) 

where we have written the components of A with upper indices since A is a sheaf vector 
field, and therefore contravariant. Similarly, the flux in the y and z direction is given by 



f A- dS - 4/2 [A^{x,y + l,z)-A^x,y-l,z)] . 

JSy ° 



(1.27) 
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and 



A-dS = 4r [A'ix,y,z + l)-A'ix,y,z - 1)] . (1.28) 

The total flux is the sum over these. Using Gauss' theorem, we relate this to the divergence 
over the volume: 

A^(x + l,y,z)- A^(x + l,y,z) 



iifL 



divAdy 



{21Y Jv 21 

Ayix,y + l,z)-Ayix,y-l,z) 



+ 



21 



A^{x,y,z + l)-A^{x,y,z-l) 
+ - . (1.29) 

In the limit Z ^ 0, the right-hand side becomes a sum over derivatives and the left hand 

side is evaluated at the infinitesimal volume around a single point with constant A in an 

infinitesimal volume (2Z)^. We can therefore write 

,. . dAy , , , 

divA= + + = V-A, (1.30) 

dx dy dz 

which relates the divergence of a vector field to a differential operator that we can use in 
practical calculations. 

1 .4 Second derivatives 

In physics, many properties depend on second derivatives. The most important example 
is Newton's second XacwF = ma, where a is the acceleration, or the second derivative of the 
position of a particle. It is therefore likely that we are going to encounter the second 
derivatives of fields as well. In fact, we are going to encounter them a lot! So what 
combinations can we make with the gradient, the divergence, and the curl? 

The div and the curl act only on vectors, while the grad acts only on scalars. Moreover, 
the div produces a scalar, while the grad and the curl produce vectors. If /" is a scalar field 
and A is a vector field, you can convince yourself that the five possible combinations are 

V-(V/") = divgrad/", 
V(V-A) = graddivA, 
Vx(V/") = curlgrad/" = 0, 
V-(VxA) = divcurlA=0, 

Vx(VxA) = curlcurlA. (1.31) 

Of these, the first (V-(V/")) defines a new operator V-V = called the Laplacian, which we 
can also apply to individual components of a vector field. The three non-zero derivatives 
are related by the vector identity 

Vx(VxA) = V(V-A)-V2A, (1.32) 

which we will use very often. 
Exercise 1.2: Prove equation (1.32). 
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1.5 Helmholtz' theorem 

The curl and the divergence are in some sense complementary: the divergence measures 
the rate of change of a field along the direction of the field, while the curl measures the 
behaviour of the transverse field. If both the curl and the divergence of a vector field A are 
known, and we also fix the boundary conditions, then this determines A uniquely 

It seems extraordinary that you can determine a field (which has after all an infinite 
number of degrees of freedom) with only a couple of equations, so let's prove it. Suppose 
that we have two vector fields A and B with identical curls and divergences, and the same 
boundary conditions (for example, the field is zero at infinity). We will show that a third 
field C = A - B must be zero, leading to A = B. First of all, we observe that 

V xC = VxA-VxB = 

V-C = V-A-V-B = 0, (1.33) 

so C has zero curl and divergence. We can use a vector identity of equation (1.32) to show 
that the second derivative of C is also zero: 

v2C = V(V-C)-Vx(Vx = 0-0 = 0. (1.34) 

This also means that the Laplacian of every independent component of C is zero. Since 
the boundary conditions for A and B are the same, the boundary conditions for C must be 
zero. So with all that, can C be anything other than zero? Eq. (1.34) does not permit any 
local minima or maxima, and it must be zero at the boundary. Therefore, it has to be zero 
inside the boundary as well. This proves that C = 0, or A = B. Therefore, by determining 
the divergence and curl, the vector field is completely fixed, up to boundary conditions. 
This result is known as Helmholtz' theorem. 

Looking ahead at the next section, you now know why there are four Maxwell's equa- 
tions: two divergences and two curls for the electric and magnetic fields (plus their time 
derivatives). These four equations and the boundary conditions completely determine the 
fields, as they should. 

1 .6 Index notation 

Consider again the vector v from equation (1.3), but now in three dimensions: 

x = xCx + yCy + zez ■ (1.35) 

Since the values x, y, and z are components of the contravariant (arrow) vector v, we can 
write them as v^, v^, and with upper indices. That way we have baked into the notation 
that we are talking about the components of v. The vector can then be written as 

v = i;^ex + i^^e3, + i;^e2. (1.36) 

Note that this is effectively an inner product, and the basis vectors e^, and are 
covariant vectors, or stacks. We also know that an inner product gives us something that 
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is invariant under coordinate transformations. Indeed, it gives us the vector v. If you're 
confused that an inner product yields a vector, remember that the vectorial character is 
carried by the basis vectors e^, Gy and 62 inside the inner product. 

We can now introduce an enormously useful trick that will save us a lot of time. Notice 
that there is some redundancy in the notation of equation (1.36). We repeat, in a sum- 
mation, the component multiplied by the corresponding basis vector. So we can write this 
straight away as a sum: 

Since we typically know (from the context) what our coordinate system is, we may refer to 
V in terms of its components v-' . This is called index notation, which we will now develop 
for more general vector operations. 

Next, suppose that v = grad /" is a gradient field, so there is a vector v at every point in 
space. We know that gradient fields are covariant (stack) vector fields, so the components 
Vj carry lower indices. We can write 



df df df 

Vx = — Vy = — and Vz = —. (1.38) 
ox ay oz 



We can abbreviate the derivatives as d^, dy, and dz (with lower indices to match the com- 
ponents of V, since f does not carry any indices), which allows us to write 

Vj = djf. (1.39) 

The next operator we consider is the divergence. We know that the divergence operates 
on a contravariant vector field (arrows or sheafs) with upper indices. This matches the del 
operator that, as we have seen in equation (1.39) carries lower indices. We therefore have 
the inner product 

dAy dA^ 

divA=V-A= + + . (1.40) 

dx dy dz 

We can write the divergence as a sum over only one coordinate derivative: 

j=x,y,z 

So far, nothing new. However, we want to save writing even more. Notice that on the 
left-hand side of Eq. (1.41) there are no fs (the divergence gives a scalar, which does not 
have any components), and on the right-hand side there are two fs. Apparently, repeated 
indices (the fs) imply that we need to sum over them! This allows us to be even more 
efficient, and also drop the summation symbol: 

V-A=a,A-^', (1.42) 



This is called Einstein's summation convention: repeated indices are summed over (this is 
also called contraction of indices). A proper inner product always contracts a lower index 
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with an upper index, because it multiplies a covariant with a contravariant vector. We will 
not be rigorous with the placement of indices until the last chapter on relativity, where it 
becomes important. 

Notice that the difference between a scalar and a vector is indicated by the number of 
indices: /" is a scalar, and Bj = djf is a vector (or rather the j^^ component of a vector). We 
can extend this to other objects with multiple indices Fjk and rjjki, etc. These are called 
tensors. For example, we can write a (random) mathematical equation 



Fjk = djAk +Bj{dkf) + dirijk' 



(1.43) 



Notice that each index on the left is matched exactly on the right, and repeated indices 
in the same term are summed over. If you were to write this out in long-hand you would 
notice that F has nine components, and rj has 27 components. Index notation is not just 
about being lazy, it's about being practical! 

The final vector operation we translate to index notation is the curl. Suppose that the 
curl of a covariant vector field A is written as 



VX A: 



I dy dz 



dAr dA; 



. dz 



dx 



(dA^ dA 



, dx dy 



= [dyA^ - d^Ay) e"" + (d^A^ - d^A^)ey + [d^Ay - dyA^) 



(1.44) 



The curl of a covariant vector field is a contravariant field (a sheaf), so the e-' carry upper 
indices. We will explore later how we get from lower to upper indices. 

There is a pleasing symmetry (or asymmetry) in equation (1.44), which we want to 
exploit. First of all, notice that there are no terms in which any of the components of e\ 
dj, and Ak have identical indices. Second, if any two indices i, j, or k in e', dj, and Ak are 
interchanged, the term changes sign (check this for yourself!). Since the curl of a vector 
field gives another vector, we can write component i of V x A as (V x A)'. We then need 
to match indices, which requires a tensor of rank three (that means three indices; two to 
contract with the indices of dj and Ak, and one for the component i). The most general 
choice possible is 

iVxAy = eijkdjAk, (1.45) 
where Cijk is the Levi-Civita symbol^. We have already established that 

Ciik = Ciki = eiik = and Cijk = -ejik = -Ckji = Ckij , (1.46) 

and so on. The non-zero components off are either +1 or -1. A particularly useful identity 
is 



£ijk£klm = SilSjm-SimSjl, (1.47) 

^Notice that we don't bother with upper and lower indices for Cijk- This is because the Levi-Civita symbol 
does not transform hke normal vectors, and our geometrical picture breaks down here. 
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Figure 14: Unit vectors in cartesian and polar coordinates. 



where da is the Kronecker delta: 




ifi^j 

1 ifi=j. 



(1.48) 



You can always do an immediate check on vector equations with indices, because the un- 
paired indices on the left must match the unpaired indices on the right. The upper and 
lower indices much match also. When you do a calculation this is a quick sanity check: If 
the indices do not match you know something is wrong! 

Exercise 1.3: Show that div(curl A) = using index notation. 
Exercise 1.4: Prove equation (1.47). 

1 .7 Unit vectors and coordinate systems 

Finally, we briefly revisit the concept of unit vectors and coordinate systems. Consider a 
vector field in two dimensions (see Fig. 14). At any point in space, we have a vector with 
a magnitude and a direction, here a black arrow. We can express the vector components 
in cartesian coordinates, such as in Figs. 14a and 14b. In that case, for every point in the 
plane the (grey) cartesian unit vectors x and y point in the same direction, and they are 
perpendicular to each other. 

On the other hand, if we express the vector in polar coordinates as in Figs. 14c and 
14d, the (grey) polar unit vectors r and 4> point in different directions at different points! 
The unit vector r always points radially outward, away from the origin, while (p always 
points tangential along the great circle through the point that is centred at the origin. At 
any point r and (p are perpendicular. You should think of these unit vectors as a local 
coordinate system at the point (x,y) in the two-dimensional space. 

Exercise 1.5: Express the unit vectors r and 4> in terms of x and y. 
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2 Maxwell's Equations and the Lorentz Force 

In this section we review the laws of electrodjTiamics as you have learned them previously, 
and write them in the form of Maxwell's equations. We start with electrostatics and mag- 
netostatics, and then we include general time-dependent phenomena. We also introduce 
the scalar and vector potential. 

The theory of electrodynamics is mathematically quite involved, and it will get very 
technical at times. It is therefore important to know when we are being mathematically 
rigorous, and when we are just putting equations together to fit the observed phenomena. 
First, we postulate the laws. In fact, it was Coulomb, Biot and Savart, Gauss, Faraday, etc., 
who did measurements and formulated their observations in mathematical form. There is 
nothing rigorous about that (although the experiments were amazingly accomplished for 
the time). However, when Maxwell put all the laws together in a consistent mathematical 
framework the rules of the game changed: In order to find out what are the consequences 
of these postulated laws, we have to be mathematically rigorous, and see if our mathemat- 
ical predictions are borne out in experiment. 



i 



2.1 Electrostatic forces and potentials 

We start our journey to Maxwell's equations with the electrostatic force. You know from 
PHY1 01 that Gauss' law relates the flux of the electric field lines through a closed surface 
to the charge density inside the surface: 

E.dS=^^, (2.1) 

where Qinside is the total (net) charge enclosed by the surface, and eo = 8.85 . . . x 10"^^ F m~^ 
is the electric permittivity of free space. Next, we use Gauss' theorem, which relates the 
flux of a vector field through a closed surface to the divergence of that vector field over the 
volume enclosed by the surface: 

(fE dS= f V-Edr. (2.2) 

JS Jv 

In addition, we can write the total charge inside the volume V as the volume integral over 
the charge density p: 

Qinside= / pdr, (2.3) 
Jv 

We therefore have 

f V-Edr= [ —dr, (2.4) 
Jv Jv £o 

Since this equation must hold for any volume V the integrands of both sides must be equal. 
This leads to our first Maxwell equation: 



V.E = ^ 



(2.5) 
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It determines the divergence of E. The physical meaning of this equation is that electric 
charges are the source of electric fields. The constant of proportionality £"o fixes the size of 
the electric field produced by a unit amount of charge. 

You have also encountered the fact that a (static) electric field can be expressed as 
the gradient of a scalar function $, called the scalar potential (previously called V). This 
implies that the curl of E is zero: 

E = -VO ^ VxE = 0, (2.6) 

since curl(gradO) = for any $. Combining Eqs. (2.5) and (2.6), we obtain the Poisson 
equation 

v2o = -— , (2.7) 

which, in vacuum (p = 0) becomes the Laplace equation 

V2o = 0. (2.8) 

The zero curl of E, and hence the Poisson and Laplace equations are valid only in electro- 
statics. We will see later how fields that are changing over time modify these equations. 

2.2 Magnetostatic forces 

We have seen that the divergence of E is given by the electric charge density. It is an ex- 
perimental fact (at least at low energies in everyday situations) that there are no magnetic 
charges; only magnetic dipoles (such as electron spins and bar magnets) and higher seem 
to exist. This immediately allows us to postulate the next Maxwell equation: 



V-B = , (2.9) 



which is sometimes called Gauss' law for magnetism. The physical meaning of this equa- 
tion is therefore that there are no magnetic monopoles. Perhaps we will find magnetic 
monopoles in the future^, in which case this Maxwell equation needs to be modified. 

For the last magnetostatic equation (the one for curIB) we first consider Ampere's law, 
where the magnetic field component added along a closed loop is proportional to the total 
current flowing through the surface defined by the loop: 

^B- dl = jUo/enclosed, (2.10) 

where [J-q - 4n x 10"^ H m~^ is the magnetic permeability of free space. You are familiar 
with this law from PHY1 02, where you used it to calculate the magnetic field due to an 
infinite current carrying wire, and similar problems. 



^According to theoretical arguments in quantum field theory, the existence of even a single magnetic 
monopole in the universe would explain why charges come in discrete values ofe- 1.602 176565 x 10"^^ C. 
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We can apply Stokes' theorem to Eq. (2.10), which allows us to write 

{ B dl= [ (VxB)- dS, 

Jc Js 



(2.11) 



where S is any surface enclosed by the loop C. At the same time, we can write the enclosed 
current /enclosed in terms of the current density J as 



(2.12) 



(2.13) 



/enclosed - / <J ' dS. 

Js 

Combining these equations we can write 

f (VxB-juoJ)- dS = 0. 
JS 

Since this must be true for any surface S, the integrand of the integral must be zero: 

VxB = ^oJ. (2.14) 
This is Ampere's law, and valid only in magnetostatics. 



2.3 Faraday's law 

Now, let's consider general electromagnetic fields that may vary in time. Lenz' law states 
that the electromotive force ^ on a closed conducting loop C is related to the change of the 
magnetic flux through the loop shown in figure 15: 



<g=<j)^E-dl 



dt ' 



(2.15) 



with 



$5= f Bir,t)-dS. 

Js 



(2.16) 



ii a 



C^— 



Figure 15: Lenz' law. 
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Do not confuse Ob with the scalar potential O: they are two completely different things! 
We can use Stokes' theorem to write Eq. (2.15) as 

dBir,t) 

IS 



£ E-dl= f (VxE)-dS = - f 



■dS. 



(2.17) 



ic Js Js dt 

Since this must be true for any surface S we can equate the integrands of the two integrals 
over S, and this gives us Maxwell's equation for the curl of E: 



dB 

VxE = 

dt 



(2.18) 



This is commonly known as Faraday's law. The physical meaning of this law is that a 
time-varying magnetic field generates vortices in the electric field (e.g., the discontinuities 
in figure 7). Our previous result of V x E = is true only when the magnetic field does not 
change over time (magnetostatics). 



2.4 Charge conservation 

Before we move to the last Maxwell equation, it is convenient to consider the mathematical 
expression of charge conservation. Charges inside a volume V bounded by the surface S 
can be a function of time: 

Qit)= [ pir,t), (2.19) 
Jv 

and the current flowing out of the volume V" bounded by the surface S is given by 



lit) 



= i J-dS= [ V-Jdr, 

JS Jv 



(2.20) 



where J is the current density. Conservation of charge then tells us that an increase of the 
charge in the volume V (a positive time derivative) must be balanced by a flow of charge 
into V through the surface S (a negative lit)). The difference between these quantities 
must be zero, or 

dQit) 



dt 



+ lit) = 0. 



When we substitute Eqs. (2.19) and (2.20) into Eq. (2.21), we find 

dp 



[ —dr+ [ V-Jdr = 0. 
Jv dt Jv 



(2.21) 



(2.22) 



When we write this as a single integral over V, this equality can be satisfied only when 
the integrand is zero: 

dp 



r dp 

Jv dt 



+ V-Jdr = 0, 



and we find the general expression for charge conservation 



dp 
dt 



+ V-J = 0. 



(2.23) 



(2.24) 



As far as we know today, charge conservation is strictly true in Nature. 
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2.5 The Maxwell-Ampere law 

The fourth, and last, Maxwell equation is a modification of Ampere's law V x B = /ioJ. As 
it turns out. Ampere's law, as stated in Eq. (2.14) was wrong! Or at least, it does not have 
general applicability. To see this, let's take the divergence of Eq. (2.14): 

V-J= — V-(VxB) = 0, (2.25) 

since the divergence of a curl is always zero. But charge conservation requires that 

VJ=-^, (2.26) 
dt 

So Ampere's law in Eq. (2.14) is valid only for static charge distributions (hence "electro- 
statics"). We can fix this by adding the relevant term to Ampere's law. 

1 dp 
V-J= — V-(VxB)-— . (2.27) 
Mo dt 

We use Gauss' law of Eq. (2.5) to write the charge density as a divergence p = e"oV-E, which 
yields 



VJ = V' 



/ 1 dE\ 

— (VxB)-eo^ ■ (2.28) 
VMo dt j 



The final Maxwell equation (the Maxwell-Ampere law) therefore has an extra term due to 
a time-varying electric field: 



dE 

VxB-/ioco— = MoJ 
dt 



(2.29) 



The physical meaning of this is that closed magnetic field lines are created by currents 
and electric fields changing over time. 

The complete set of (microscopic) Maxwell equations is 

dB 

V-B = and VxE+ — = (2.30) 

dt 

p dE 

V-E = — and VxB-uoeo — = uoJ. (2.31) 

Co dt 

The top two equations are the homogeneous Maxwell equations (they are equal to zero), 
and the bottom two are the inhomogeneous Maxwell equations (they are equal to a charge 
or current density). For every Maxwell equation we have the behaviour of the fields on 
the left hand side, and the source terms on the right hand side. This is generally how field 
equations are written. This way, it is clear how different source terms affect the fields. 

Exercise 2.1: Show that charge conservation is contained in Maxwell's equations by 
taking the divergence of V x B and substituting V • E. 
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2.6 The Lorentz force 

The Maxwell equations tell us how the electric and magnetic fields are generated by 
charges and currents, and how they interact with each other (i.e., their dynamics). How- 
ever, we still have to relate Maxwell's equations for the electromagnetic field to Newton's 
second law, F = ma. In other words, how are the fields related to the force on a particle? 
A particle of charge q and velocity v experiences the Lorentz force 

F = gE + 9vxB, (2.32) 

and is sometimes used to define what we mean by E and B in the first place. In modern 
physics we say that E and B exist as physical objects in their own right, regardless of the 
presence of test charges q, or an aether, for that matter. The Lorentz force is necessary to 
complete the dynamical description of charged particles in an electromagnetic field. 

Consider a zero B field, and the electric field is created by a point particle with charge 
Q at the origin. We can go back to Gauss' law in integral form in Eq. (2.1) and say that 
the flux of electric field through a spherical surface centred around the charge Q is pro- 
portional to the charge: 



L 



EdS = — . (2.33) 

s eo 



By sjrmmetry, the E field points in the radial direction r, or E = Er with E the magnitude 
of E. This is aligned perfectly with the infinitesimal surface elements cZS, or r- dS - dS 
(since r has length 1). Moreover, the magnitude E is constant over the surface, also by 
symmetry. The integral therefore becomes 



E [ dS = —. (2.34) 
Js eo 



The radius of the sphere is |r|, where r is a point on the surface of the sphere. We can 
therefore evaluate the integral as the surface area of the sphere: 

E [ dS = 4:n\rfE = —. (2.35) 



Is eo 

Putting back the direction of the E field as r, we find that the electric field of a point charge 
at the origin is given by 

E = — ^-^r. (2.36) 
47reo|r|^ 

Substituting this into the Lorentz force with zero B field yields 

F= . ^ , „ r, (2.37) 

which you recognise as the Coulomb force. The force on the charge q at position r points 
away from the charge Q (in the direction r), which means it is a repulsive force. When q 
and Q have opposite signs F gains a minus sign and becomes attractive. 
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3 Scalar and Vector Potentials 

You are familiar with the concept of a potential difference between two plates of a capaci- 
tor, or a potential produced by a battery. You also know that such a potential produces an 
electric field that is a gradient of this potential. In this section, we extend the concept of 
a potential not only to this familiar one (which we will denote by $), but also to a vector 
potential A. 



3.1 The vector potential 

Since the electric field is the gradient of a scalar field (the scalar potential O), it is natural 
to ask whether the magnetic field is also some function of a potential. Indeed, this is pos- 
sible, but rather than a scalar potential, we need a vector potential to make the magnetic 
field: 



B = VxA (3.1) 

The proof is immediate: 

V-B = V-(Vx A) = 0, (3.2) 

since the divergence of a curl is always zero. We can consider this the definition of the 
vector potential A. At this point, the vector potential A, as well as the scalar potential O, 
are strictly mathematical constructs. The physical fields are E and B. However, you can 
see immediately that whereas E and B have a total of six components {Ex, Ey, Ez, B^, By, 
and Bz), the vector and scalar potentials have only four independent components (O, Ax, 
Ay, and A^). This means that we can get a more economical description of electrodynamics 
phenomena using the scalar and vector potential. 

Before we embark on this more compact description, we note that the electric field is 
also affected by the vector potential A. The Maxwell equation 

dB 

VxE = (3.3) 

dt 

implies that E can't be a pure gradient, or we would have V x E = 0. Instead, we use this 
equation and substitute Eq. (3.1) to give us 



dB d 
VxE+ — = VxE+— VxA = or Vx 
dt dt 



dA\ 

E + — =0. (3.4) 

dt I 



Therefore E + A is a gradient field: 



dA dA 

E+ — = -VO or E = VO. (3.5) 

dt dt 

This is the proper expression of the electric field E in terms of potentials. When A does 
not depend on time, this reduces to the electrostatic case where the electric field is simply 
the (negative) gradient of the familiar scalar potential. 
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We should re-derive Poisson's equation (2.7) from V - E = p/eo using this form of E: 



o d p 
v2(D + _(v.A) = - — 
dt Co 



Similarly, we can write 

V X B = V X (V X A) = V(V • A) - V^A = ^qJ + hqCq— -VO 

dt\ dt I 

We can rewrite the last equality as 



V^A-)Uoeo 



— |-V(V-A) = -poJ 



(3.6) 



(3.7) 



(3.8) 



Eqs. (3.6) and (3.8) are Maxwell's equations in terms of the scalar and vector potential, 
and amount to two second-order differential equations in O and A. This looks a lot worse 
than the original Maxwell equations! Can we simplify these equations so that they look a 
bit less complicated? The answer is yes, and involves so-called gauge transformations. 



3.2 Gauge transformations 

As we have seen, the electric and magnetic fields can be written as derivative functions of 
a scalar potential O and a vector potential A. You are familiar with the fact that adding a 
constant to a function will not change the derivative of that function. The same is true for 
the electric and magnetic fields: they will not change if we add constants to the potentials 
(you already know this about the scalar potential). However, since we are dealing with 
three-dimensional derivatives, it is not just adding constants to the potentials that do not 
change the fields; we can do a bit more! 

Recall that B = V x A. Since V x (VA) = for all possible scalar functions A, we see that 
we can always add a gradient field VA to the vector potential. However, we established 
that the electric field E depends on the time derivative of the vector potential. If E is to 
remain invariant under these changes of the potentials, we need to add the time derivative 
of A to the scalar potential O: 

dA dVA dA „ 

E' = -V(D + V — =E, (3.9) 

dt dt dt 

since VidfA) = ^^(VA). It is clear that the full transformation is 

aA(r,^) 

0(r,^) - (D'(r,^)-0(r,^) , 

dt 

Air,t) ^ A'(r,^) = A(r,^) + VA(r,i). (3.10) 



This is called a gauge transformation. Let's give a formal definition: 
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Definition: Agauge transformation A(r, t) is a transformation (a change) of the potentials 
that leave the fields derived from these potentials unchanged. 

On the one hand, you may think that it is rather inelegant to have non-physical degrees 
of freedom, namelythe function A(r, t), because it indicates some kind of redundancy in 
the theory However, it turns out that this gauge freedom is extremely useful, because it 
allows us to simplify our equations, just by choosing the right gauge A. 

One possible choice is to set V • A = 0, which is called the Coulomb gauge (or radiation 
gauge). We will see why this is a very useful choice when we discuss electromagnetic 
waves. Another useful gauge is the Lorenz gauge, in which we set 

a<i) 

V-A + ;Uoco— = 0. (3.11) 
This leads to the following Maxwell equations: 



V^-iUoeo^ )$ = --, (3.12) 
V^-l^oeo-^jA=-^oJ. (3.13) 

Now you see that Maxwell's equations have simplified a tremendous amount! Gauge 
transformations are important in field theories, particularly in modern quantum field 
theories. The differential operator in brackets in Eqs. (3.12) and (3.13) is called the 
d'Alembertian, after Jean le Rond d'Alembert (1717-1783), and is sometimes denoted by 
the symbol □. This makes for a super-compact expression of Maxwell's equations: 

□ = -— and □A=-jUoJ. (3.14) 
£"0 



P 



The differential equations (3.12) and (3.13) are called d'Alembert equations. 

Exercise 3.1: Show that the Lorenz gauge leads to Maxwell's equations of the form 
given in Eqs. (3.12) and (3.13). 



3.3 A particle in an electromagnetic field 

Often we want to find the equations of motion for a particle in an electromagnetic field, 
as given by the scalar and vector potentials. There are several ways of doing this, one of 
which involves the Hamiltonian H(r,p), where r and p are the position and momentum of 
the particle, respectively. You know the Hamiltonian from quantum mechanics, where it 
is the energy operator, and it is used in the Schrodinger equation to find the dynamics of 
the wave function of a quantum particle. However, the Hamiltonian was first constructed 
for classical mechanics as a regular function of position and momentum, where it also 
completely determines the dynamics of a classical particle. The idea behind this is that 
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the force on an object is the spatial derivative of some potential function. Once the Hamil- 
tonian is known, the equations of motion become 

dn dH dpi dH 

—— = - — and —— = -—. (3.15) 
dt opi dt ori 

These are called Hamilton's equations, and it is clear that the second equation relates the 
force (namely the change of momentum) to a spatial derivative of the Hamiltonian. 

The derivation of the Hamiltonian of a particle with mass m and charge q in an elec- 
tromagnetic field is beyond the scope of this course, so we will just state it here: 

H(r,p) = [p - qMr, t)f + q <D(r, t) , (3.16) 
2m 

where r is the position and p the momentum of the particle. In a homework exercise you 
will be asked to show that this very general Hamiltonian leads directly to the Lorentz 
force on the particle. The Hamiltonian can be further simplified for the specific problem 
at hand by choosing the most suitable gauge. In addition, the field is often weak enough 
that we can set « 0. In this case the Hamiltonian becomes that of a free particle with a 
coupling term ^p-A, also known as dipole coupling. 

As a simple example, consider a particle with mass m and charge g in a field with 
scalar potential = and vector potential A = Aocos(kz - (ot)-k. What are the equations 
of motion if the particle is at rest at time ^ = 0? Since A has only and a; -component, we can 
write the Hamiltonian as 

H= —ip-p-2qp-A + A-A) + qO 

2m 

= — [pI+ p^ + pI - 2qpxAocosikx - (ot) + q'^Alcos'^ikz - a)t)] . (3.17) 

the first set of Hamilton's equations then become 

dx dH Px qAo ^, , dy dH Py dz dH pz 

x = — = - — = cosKkx - (ot) ; y-—r- — = — 5 = ^ = — = — > 

dt dpx mm dt dpy m dt dpz m 

and the second set become 

■ = = =0 
dt dx ' 
dpy dH ^ 



dpx 


dH 


dt 


dx 


dpy 


dH 


dt 


dy 


dpz _ 


dH 


dt 


~dz 



— qPxAok q^Alk . 

Pz = —r- = = sm(ft£ - bjt) + — sm(2kz - 2o)t) . 

dt dz m m 

The boundary condition states, among others, that Px = at time ^ = 0. This will remain 

the case at all times, since Px = 0. The first term in pz is therefore zero, and the force on 

the particle is given by 

„ dpx .dpy dpz ^ q^Alk . 

F = — — x-i- y + z = sin{2kz-2(ot)z. (3.18) 

dt dt dt m 
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The momentum in the z-direction will change accordingly. This is a periodic force on the 
charged particle due to an oscillating potential. Note the doubling of the frequency. 

This example elucidates the most basic operation of a receiver antenna. A wave with 
frequency cj produces a periodic force on a charge q in the z-direction. If we align a thin 
conducting wire to the z-direction, the moving charges create a current that can be mea- 
sured. We will discuss the antenna as a source of radiation in Section 7. 

As mentioned, the Hamiltonian completely determines the dynamics of a system, and 
as such is a very useful quantity to know. In quantum mechanics, we replace the position 
vector r by the operator r and the momentum vector p by the operator p (note that the hat 
now denotes an operator, rather than a unit vector. In the rest of the lecture notes the hat 
denotes unit vectors). This makes the Hamiltonian an operator as well. The Schrodinger 
equation for a particle in an electromagnetic field is therefore 



The vector potential A and the scalar potential O are classical fields (not operators). The 
full theory of the quantized electromagnetic field is quantum electrodynamics. 

The electric and magnetic fields are the physical objects that we usually say "exist" in 
Nature, independent of observers or test charges. On the other hand, the potentials are 
purely mathematical constructs in the classical theory of electrodjmamics. The fact that 
we can change them using gauge transformations without changing any physically mea- 
surable quantities suggests that they are merely convenient mathematical tools. However, 
in quantum mechanics things are more complicated (as always). Since the Hamiltonian 
of a charged particle couples to the potentials, rather than the fields, there are instances 
where a change in the potentials leads to observable phenomena, e.g., the Aharonov-Bohm 




2m 



1 



(p-gA)2 + gO 1^). 



(3.19) 



effect. 
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4 Solving Maxwell's Equations: Electrostatics and Magnetostatics 

We have established that Maxwell's equations (together with the Lorentz force) describe 
all electrodynamics phenomena. However, they are differential equations in the electric 
and magnetic fields, and we need to establish how to solve them given some initial or 
boundary conditions. This is usually a very complicated task, and we can only give a few 
methods for simple problems here. More complicated problems typically require numerical 
methods on computers. I will be following chapter 3 in Griffiths (2008) for this section. 



4.1 The Poisson and Laplace equations 

First we consider electrostatic situations in which the magnetic field plays no role. We 
have seen that this leads to a zero curl of the electric field V x E = 0, and E is fully deter- 
mined by the scalar potential E = -VO. This leads to the Poisson equation 

v2o = -— , (4.1) 
or, in regions of space without charge, the Laplace equation 



Electrostatics is all about solving these equations. In particular, we will be looking at 
solving the Laplace equation in some region of space V where the scalar potential is due 
to some distant charges outside the area of our interest. 

The Laplace equation In one dimension 

In one dimension, the potential O depends only on a single variable, and we can write 

You can immediately solve this very easy differential equation analytically by writing 

0(a;) = ax + b, (4.4) 

where the boundary conditions determine a and b. 

This is pretty much all there is to it in one dimension, but it will be instructive to note 
two properties of this general solution: 

1. At point X, the scalar potential 0(x) can be seen as the average of the two potentials 
on equidistant points on either side of x: 

0(x) = - [0(x -u) + 0(x + w)] for any u in the domain of 
2 



a2 a2 a2 



2. There are no local maxima or minima, as this would invalidate point 1. 
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z 




Figure 16: A solution to Laplace's equation in two dimensions. 

The Laplace equation in two dimensions 

Now we will consider the more interesting problem 

This is a partial differential equation, and it will have much more complicated solutions 
than just flat sloped surfaces, as you may have gathered from the Laplace equation in one 
dimension. However, the two properties listed for the ID equation are still valid in two 
dimensions, and will give us some valuable clues when we try to solve the equation. 

1. At the point ix,y), the scalar potential ^(x,y) can be seen as the average of the 
potentials on equidistant points around (x,y), i.e., a circle C of radius r: 

0(x,y) = ^dl for any r in the domain of O. 

2nr Jc 

An example of such a circle is given in Fig. 16. 

2. There are again no local maxima or minima (as this would invalidate point 1), and 
there are no points where a charge can sit in an equilibrium, stable or unstable. 

A consequence of point 1 (the averaging method) is that the surface defined by f^(x,y) is 
pretty featureless. A physical picture of what comes close to 0(x,y) is that of an uneven rim 
with a rubber membrane stretched over it. The surface of the membrane will generally be 
curved because of the uneven height of the rim (see for example Fig 16), but it also wants 
to be as flat as possible. 
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The Laplace equation in three dimensions 

In three dimensions it is impossible to draw intuitive pictures such as Fig 16, but the two 
Usted properties still remain true: 

1. At the point r, the scalar potential $(r) can be seen as the average of the potentials 
on equidistant points around r, i.e., a sphere S of radius r: 



2. There are again no local maxima or minima (as this would invalidate point 1), and 
there are no points where a charge can sit in an equilibrium, stable or unstable. 

For the proof of point 1, see Griffiths (2008), page 114. 
Boundary conditions for Laplace's equation 

Every differential equation needs boundary conditions to pick out the relevant solution to 
your problem. But how many boundary conditions are sufficient to determine the solution, 
and when do we specify too much? This is an important question, because specifying too 
many boundary conditions will likely result in contradictory requirements, and you won't 
be able to find a solution at all (not even the trivial one, O = 0). To help determine what 
are necessary and sufficient boundary conditions, we state two "uniqueness" theorems: 

First Uniqueness Theorem: The Laplace equation for O in a volume V with surface S 
has a unique solution when O is given on the boundary S. 

This theorem requires knowledge of the potential at the boundary. However, in practice 
we may be able to specify only the charges on various conducting surfaces. In this case the 
second uniqueness theorem helps: 

Second Uniqueness Theorem: The electric field in a volume V is uniquely specified by 
the total charge on each conductor surrounding V and the charge density inside V. 

For the proofs of these theorems, see Griffiths (2008), pages 116-121. 
4.2 Charge distributions near conducting surfaces 

The uniqueness theorems have an interesting consequence, which we will explore by con- 
sidering the following example. Consider a grounded conducting plate parallel to the xy- 
plane at 2 = 0, and a point charge q a distance d above the plane. Since the conducting 
plate is grounded, the potential O is zero. We can consider this a boundary condition for 
working out the potential above the plate: $ = for 2 = 0. At infinity the potential also 
drops off to zero, so we have a completely specified the potential on the boundary of the 
volume we are interested in (the half-space of the positive 2-axis). The first uniqueness 
theorem then tells us that we can uniquely determine O in this half-space. 




for any r in the domain of O. 
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Figure 17: The method of images. 



However, since the boundary potential determines O, any physical situation that pro- 
duces these potentials at the boundary will necessarily produce the right potential in the 
volume of interest. We can consider the different physical situation where the conducting 
plate is replaced with a point charge -q on the 2-axis at the position -d. We can superpose 
the potentials of the two charges and obtain 



1 



(4.6) 



It is easy to see that = for 2 = 0, and O vanishes at infinity. So this situation gives 
the same boundary conditions as the charge above a conductor. By the first uniqueness 
theorem, the scalar potential in Eq. (4.6) is the correct description of the potential above 
the conducting plate. This is called the method of images, since the conductor effectively 
acts as a mirror. The image charge must have the opposite sign to the original charge. 
It also means that the charge is attracted to the plate with a force 



F = - 



(4.7) 



4neo{2d)^ ' 

In turn, this must be due to negative charges in the conductor rushing towards the point 
closest to the point charge. This is the induced surface charge a. The electric field at the 
surface of the conductor is perpendicular to the surface, and from Gauss theorem we have 



E = ^k 

Co 



at 2 = 0. 



From this we find that the induced surface charge is 

dO 



aix,y)^-eo 



dz 



qd 



(4.8) 



(4.9) 



^=0 27r(x2 +3/2+^2)3/2 • 

When we integrate over the surface of the conductor, we find that the total induces charge 
is -g. Did you expect this? 

Exercise 4.1: Derive Eq. (4.7) from the Coulomb force and by calculating the field 
directly from O. 
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4.3 Separation of variables 

One of the most powerful methods for solving the Laplace equation is separation of vari- 
ables, in which we assume that O is a product of functions that only depend on x and y, 
respectively. For example, in two dimensions we can choose 

^ix,y) = Xix)Yiy). (4.10) 

Ignoring complications where X(x) = or Y(y) = 0, we can divide the Laplace equation by 
X(x)Yiy) to obtain 

Xd^'Ydy^ = '- 

Note that these are no longer partial derivatives. For this equation to hold, both terms 
must be proportional to a constant, Ci and C2 with Ci + C2 = 0: 

I'^^^^C, and i!?I = C.. (4.12) 



X dx^ Y dy^ 

If either term were to depend on their corresponding variable x or j we could keep one 
constant fixed while changing the other, thereby breaking the requirement Ci + C2 = 0. 
The Laplace equation in two dimensions then becomes two separate linear differential 
equations 

^=k'X and ^--k'Y, (4.13) 
dx-^ dy^ 

where we chose Ci = -C2 = k^. The solutions to these equations are 

X(x) = Ae*^+Be-*^ and Yiy) = Csmky+Dcosky, (4.14) 

and the potential becomes 

^kix,y) = Xix)Yiy) = (Ae*^ +Se-^^)(Csin% +Dcos%), (4.15) 

for some choice of k. Since the Laplace equation is linear, we can choose any superposition 
of Ok with different values of k to fit the boundary conditions. 

The separation of variables does not have to happen in Cartesian coordinates; we can 
use spherical coordinates as well. These are often better suited to our problem (e.g., when 
we have a spherical or cylindrical symmetry in our system). The Laplace equation in 
spherical coordinates is given by 



1 d ( 2dO\ 1 d 



dr 



I do\ 1 a^o 



dr I r'^&va.O dO 



sine — + 5 ^ = 0. (4.16) 

V del r2sin20#2 
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For the special case of cylindrical symmetry O does not depend on and we have effec- 
tively a two-variable equation. Using separation of variables and setting 0(r, 6) = R(r) 0(0) 
then yields 

1 d ( ^dR\ 1 d ( d@\ 
Rdr[ dr) esin0d0l dd ) 

This leads to two ordinary differential equations 

^{r^^]=l(l + l)R and fsine^l = -Z(Z + 1)0, (4.18) 

dry dr I smO dO \ dO j 

where the constants ±l{l + 1) are chosen such that the solutions to the differential equation 
for R are given by 

R{r) = Ar^ + ^, (4.19) 

and the solutions to the differential equation for are given by the Legendre polynomials 

0(0) = P/(cos0). (4.20) 
These polynomials are defined by Rodrigues' formula 



Pdx)= ^ 



^] ix^-iy. (4.21) 

^ CLX I 



2H\ 

The first few Legendre polynomials are 

PoU) = l, Pi{x) = x, P2(^)= -(3x^-1), P3(x)= -(5x^-3x). (4.22) 

2 2 

These polynomials are orthogonal in the sense that 

C Pl{x)Pi,{x)dx = Y ^ '1^5' (4.23) 
J-i ifZ' = Z. 

Exercise 4.2: Derive a formula for P/+i(x) given Pi(x), P;_i(x), P;_2(x), . . . 

The general solution for the scalar potential in cylindrically symmetric problems is 
now given by 

O(r,0) = - ^ Uy + -i- P;(cos6l). (4.24) 



As an example, we calculate the potential inside a hollow sphere of radius R whose shell 
is held at a potential Ooid) that may depend on the angle 6. First, we note that Bi =0 
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for all I, otherwise the potential becomes singular at r = 0, the centre of the sphere. We 
therefore have 

0(r, 0) = - ^ Air^Pi (cos 6) = Oo(0) ■ (4.25) 

We can see that if the potential is constant only the Z = term contributes, for if we 
include I -1 the cos 6 term must be cancelled by an Z = 3 term, which in turn introduces a 
higher power in cosd, and so on. This means that the potential throughout the sphere is 
constant. In general, we can calculate the coefficients using the orthogonality relation for 
the Legendre polynomials, and we find 

Ai = r ^oie)Piicosd)sin6de. (4.26) 

2i?' Jo 

You should prove this relationship. When we choose ^oiO) = ^sin^(0/2) for some real 
constant W, we can write this in terms of Legendre pol5momials as 

Oo(0) = 1'sin^(0/2)= ^(l-cos0)= ^[Po(cos0)-Pi(cos0)]. (4.27) 
We then find that Aq = W2 and Ai = -W/2R. 

Exercise 4.3: Let ^oid) = for < < 7r/2 and OoiO) = for n/2 < 6 < n. Which I 
contribute to the potential in Eq. (4.25)? 



4.4 Electric dipoles, quadrupoles and higher multipoles 

What if we do not have any special symmetries in the problem? In that case the potential 
will depend on all three coordinates: O - ^{r,9,(p). We will now consider the potential 
of an arbitrary localised charge distribution. Moreover, we are reasonably far away from 
the charge distribution, so we can develop O in powers of 1/r. From the cylindrically 
symmetric potential in Eq. (4.24) you see that all A/ = 0, otherwise the potential would 
blow up towards infinity (what about Aq?). We now include the (p dependence, so the 
potential can be written as 

O(r,0,0)=lf t (4.28) 
eoirom=-z2Z + l 

where the Yimi6,(p) are the spherical harmonics. These are defined in terms of the^ereer- 
alised Legendre polynomials P^icosO), 



Yimie,(p) = ^ ""^- mcosd) e'^'^ for m>0, (4.29) 

V 4:/r (Z + m)! ' 
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and Yi^-mie,(p) = i-irYUe,(p), with 

(_-i\m ( J \l+m 

PTix) = ^-^il-x^r'^ -f (x2 - 1)^ . (4.30) 
* \dxi 

The spherical harmonics also obey an orthogonality relation: 

d(p\ de sine Yi^{d,(l))Y*^,{e,(p) = 5u,6mm', (4.31) 
Jo Jo 

which will come in handy next, when we want to calculate the coefficients Qim in Eq. (4.28). 
The spherical harmonics are also complete, in the sense that any potential $ outside the 
charge distribution can be written in terms of these polynomials, without needing extra 
functions of 6 and (j). The first few spherical harmonics are given by 

l^oo(0,0) 

Fio(0,0) 

yi,±i(0,0) 

^20(0,0) 
Y2,±lie,(P) 
Y2,±2ie,(p) 

Y^o{e,(p) 

You see that these are complex polynomials. 
Exercise 4.4: Verify the Yi^iO, (p) in Eq. (4.32). 

You know from the first year that the potential at a point r = (r, 0, (p) due to a point charge 
q at the origin is given by 

cD(r,0,0)=— (4.33) 

If the charge is not at the origin, but instead at a point r' = (r' ,6' ,(p'), we have to replace 
r in the expression for $ with the distance between the charge and the point where we 



1 



4n 
3 




COS0 




3 z 
An r 



- / 3 . n +i<p _ I ^ x±iy 

+ \l —smO e-^ = +\ 

8n V oTT r 



1671 



(3cos^0-l) 



' 5 3z'-r' 
16n 



- /l5 . „ +^0 _ 15 z{x±iy) 
+ \ — sin0 COS0 e-^ = +\ ^ 



8n 



15 .2n+2i^_ / 15 ix±iy)^ 



32n 



■ sin e~ 



16n 



(5cos"^0-3cos0) = 



3271 

^^5z^-3zr^ 



16n 



(4.32) 
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Figure 18: The potential of a physical dipole, a "perfect" dipole and a quadrupole. 



evaluate the potential: r |r-r'| In addition, we can think of the point charge at r' as 
the charge density p(rO at r' multiplied by an infinitesimal volume around r', denoted by 
dr' . The superposition principle will now let us sum over all charges in those infinitesimal 
volumes (which is an integral over all space), and we obtain 

1 C o(r') 

0(r) = dr', (4.34) 



4neQ 

where the integral over space is given by 

[ dr'= Hr'^dr' C sinO' dd' C"" d(p' , (4.35) 
J Jo Jo Jo 

and p(r') drops off quickly as r' oo. We now have two completely general expressions 
for the potential outside the charge distribution, namely Eqs. (4.84) and (4.28), so we can 
compare the two. In particular, we can solve for the Qim by multiplying both versions of $ 
with Y^^i6,(p) and integrate of 6 and <p. This yields 

Qim= f Y;je,cl))r^pir)dr. (4.36) 

These are the so-called multipole moments of the charge distribution. The first couple of 
multipole moments are the monopole, the dipole, and the quadrupole. You may think that 
the dipole moment Qio is all you need to describe the physical dipole p = qd of two equal 
and opposite charges q a distance d apart. However, this is not true. The dipole moment 
here describes a "perfect" dipole, where d ^ and g ^ oo while keeping p constant (see 
Fig. 18). A similar argument holds for higher-order multipoles. 

Let us calculate the Qim for a physical dipole with a positive point charge q at the 
position (0,0, d/2) and a negative point charge -q at the position (0,0, -d/2). The charge 
density for such a configuration is given by 



p(x,y,z) - q 6(x)d(y)d 



^yqd(x)5(y)dlz + ^y (4.37) 
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where 6{x), etc., are the Dirac delta functions. We substitute this charge density into the 
expression for Qim, and we find that 

Qoo= [ Y*Qie,cl))pix,y,z)dr=^ [ pix,y,z) dr = , (4.38) 
J \/4nJ 



4n 

since the integral over p gives us the total charge, which is g - g = 0. If the two charges 
are unequal, you see that the potential has a net monopole contribution. 
The dipole moments are calculated as 



Qio = j Y^Qie,(l))r pix,y,z)dr = qJ z^5 ^-^j-'^ 



f d 

z + - 



dz 




qd, (4.39) 



and 



Qii = j Y*^i9,(p)rpix,y,z)dr = -^j^jix + iy)[q5ix)diy)-q6ix)6iy)]dxdy = 0. (4.40) 

A perfect dipole would be completely characterised by Qio and Qn, but a physical dipole 
(with finite d) has higher order moments as well. You can show that the quadrupole 
moments are all zero because the Yim are either even powers in z or contain x and y, but 
Qso will not: 



Q30 = W / (52^ - 32r2) q6ix)6iy) 



7 r \ ^ 

q 2z^\6 



16n 



Z + 



z + 



d 



dxdy dz 



dz 



64n 



qd"^. 



(4.41) 



When d ^ 00 for a perfect dipole, you see that Q30 0, as expected. 

Exercise 4.5: Construct an octupole (8 charges in a cubic arrangement) to cancel the 
Q30 of the physical dipole, and make it more like a pure dipole. 



4.5 Magnetic multipoles 

Analogous to the scalar potential O in Eq. (4.34), we can express the vector potential A in 
terms of the current distribution J. At the point r the vector potential A(r) is proportional 
to the current distribution J(r') at point r', and the strength is inverse proportional to the 
distance between the two points |r-r'|. The total vector potential is then the superposition 
of all these contributions: 

Mr) = !^(^dr'. (4.42) 
4n J |r-r| 



and the magnetic field is obtained as usual via B = V x A. To find the multipole expansion of 
A we assume that the current density is nonzero in a small region so that r » r'. Generally, 
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the function l/|r-r'| can be expressed in terms of Legendre poljmomials and spherical 
harmonics (to all orders, and with r' < r) as 



OO ^tl 



— ^ = E-7TT-P/(cosr) 



r-r 



OO I 



(4.43) 
(4.44) 



l=Om=-l 

where r = (r,9,(p) and r' = {r' ,0' ,(p'), and the angle 7 between r and r' is given by 

cos7 = COS0 cos0' + sin0 sind' cos,{(p-(p') . (4.45) 

Since l/|r-r'| is such an important function in electrodynamics, Eqs. (4.43) and (4.44) are 
extremely useful expansions of this function. 



Exercise 4.6: Show that |r - r'l 



+ r'^ - 2rr' cosy. 



The general expansion of the vector potential in spherical harmonics is obtained by sub- 
stituting Eq. (4.44) into Eq. (4.42) 



A(r): 



An J 

/OO L -t 
Z=Oot=-/ M + Lr 



J(rO 

|r-r'| 

00 I 



J{v')dv'. 



(4.46) 



We can collect all the terms with dependence on r' , 6' and cp' into a single integral over r', 
which becomes a vector M^^ depending on I and m: 

M/^ = j (P') r'' J(r', 0', <p')\ r'^ sin0' dr' dO' dcp' , (4.47) 

where we included the Jacobian r'^sind' for integration over spherical coordinates. These 
so-called magnetic moments play the same role as the Qi^ for the scalar potential in 
Eq. (4.36), and the expansion of the vector potential in magnetic moments is given by 



A(r) = )Uo2^ E 



iTom=-/2/ + l r 



l+i 



(4.48) 



Finding A(r) thus reduces to finding the magnetic moments M/^, and for r » r' generally 
only the lowest orders of I contribute. 

We can substitute specific current densities to calculate the vector potential directly 
using Eq. (4.42), but this is often a very difficult integral. So instead we calculate the 
vector potential indirectly via the magnetic moments in Eq. (4.47). Since in physics we 
always deal with finite precision, this is often all we need. We consider two examples. 
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As our first example, let the current density given by an infinitely thin circular wire of 
radius a carrying a current / in the -plane: 

J{r)=-^8{e-^n)8{r-a)4), (4.49) 

We substitute this form of J into the expression for M/^^j to obtain 

Mz^ = ^ / ^('•-«)<^'^'^sin0'dr'(i0'd(^'. (4.50) 

We immediately evaluate the Dirac delta functions, which leads to 

Mim = Ia^ yL[h^<P') 4>'d(P'. (4.51) 

tJ 

We can't go much further with this general solution, so we evaluate the lowest-order mo- 
ments. For Moo this integral reduces to 

-271 

Moo = ^/ cj)'d(P'. (4.52) 



7 r^^ 

V^n Jo 



The unit vector 0' actually depends on (p, so we cannot take it out of the integral (remem- 
ber exercise 1.5 on page 21). Instead, we need to decompose it into i and j components: 

I r^"" 

Moo = ^=/ (-sin0'i + cos0'j)d(/)'. (4.53) 

The unit vectors i and j are independent of (p' and can be taken out of the integral. Now 
we use 

I COS (f)' d(l)' = and | sin (f)' d(l)' = , (4.54) 
Jo Jo 

and we find that Moo = 0- The circular ring current does not have a monopole. Similarly 
you can show that Mio = 0, and we calculate Mi +i next: 



M 



11 



la I Y*Jln,(l)') 4)d(p' = -Ia\ — \ e"^*^ 4)d(p' 
Jo V 8:/rJo 

/ 3 r2n 

-Ia\ — I [cos(])' -i sin (])'][- sin (p' i + cos (p'j) d(p' 

V oTTJo 

/ q p27i 

-Ia\ — I (-COS0 sinc^ i + cos 0j + isin (p i- i cos (f) sincp j) dip 

V oTTJo 



-,aJ±-(iUi], (4.66) 



(4.56) 
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Figure 19: The vector potential A at 2 = of a circular current loop (left), and the corre- 
sponding B field at ^ = (right) to lowest order in I . The region r < r' is excluded, since 
the solutions are not valid there. 



Substituting these values of Mi +i into A in Eq. (4.48) yields, to lowest order 



A(r)«^ 
3 



M 



11- 



+ M 



1,-1" 



Hola sirvd 



8n 



e'1'{ii + j] + e-'1'{-ii + j] 

2 ^■ 



(4.57) 



Since A = A(p4>, the curl of A in spherical coordinates simplifies and the magnetic field 
becomes 



B(r,e,cl)) 



1 



rsin0 



1 



(drA 



[ dr 



(4.58) 



The vector potential and the field are shown in Fig. 19. The central region is excluded, 
because our solutions hold only fro r > r'. To find the field inside the circular loop the 
fraction of radii in Eq. (4.42) must be modified as 



j+i 



,1+1 ■ 



(4.59) 



The solution is then found by following the exact same steps, but the multipole expansion 
may not converge as fast as the one in the far field. 
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The second example we consider here is the field due to a rotating charged spherical 
shell of radius a. The charge density a given by 

p = -^5{r-a), (4.60) 

where Q is the total charge on the sphere. You can see that charge is distributed uniformly 
over the surface, because p does not depend on 6 and (p. The current density can be found 
from J = puj X r, where a; is the angular velocity of the sphere, pointing int he 2-direction. 
This can be written as 

Qo) ^ 
J = pcjxr= r s,m.d 8{r - a) (p . (4.61) 

We again calculate the magnetic moments M-lm. We see that Mqo = Mio = due to the 
absence of cf) in the corresponding spherical harmonics, so we calculate the 1 = 1 moments 
with m = ±1: 

Mii = j Ynid,(l))rJir,d,(l))dr 

^ sine e'"'' sine Sir - a) 4>dr 

8n 4na'^ J 

3 Qa)a^ < ^ ' ^"2^ 



8:71 4n 

3 Qcoa^ ii + 3 
8n~37i 2~ 



j\in^ede^ ''e-^'^^d^J 



(4.62) 



M 



1,-1 




(4.63) 



where we solved the integral over (p in the same way as in Eqs. (4.55) and (4.56). The 
vector potential then becomes 

Yi-iie,(l))\ ^oQcoa^ sine 



A(r) = 



P-o 



M 



11- 



Yuie,ci>) 



+ M 



1,-1" 



12;r2 



0, 



and the magnetic field B = V x A is 



^' = 3 (2 cos r + sine ) . 



(4.64) 



(4.65) 



127r2r3 

The dependence on r, 0, and (/> of A and B is identical to that of the circular current loop, 
since we included exactly the same magnetic moments Mi + i. So in order to simulate the 
field of the rotating spherical shell with a current loop, we require 

, 2 



2>n 



Qco. 



(4.66) 



While it is no surprise that 7 oc Qco, the numerical factor 2/37r is much harder to guess. 
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5 Solving Maxwell's Equations: Electromagnetic Waves 

In the previous section we explored solutions of Maxwell's equations in electro- and mag- 
netostatic situations. Here we will take into account the situation of time-varying fields. 



5.1 Maxwell's equations In vacuum: the wave equation 

In field theories we can often find nontrivial solutions to the field equations (meaning that 
the field is not zero ever5rwhere), even when there are no sources present. For electrody- 
namics, in the absence of charges and currents we can still have interesting effects. So 
interesting in fact, that it covers a whole discipline in physics, namely optics. 
We obtain Maxwell's equations in vacuum by setting p = J = in Eq. (2.30): 

dB 

V-B = and VxE = (5.1) 

dt 

dE 

V-E = and V x B = uqCo — • (5.2) 

dt 

Taking the curl of V x E and substituting V x B = fiocodfE we find 
Vx 

which, with V • E = 0, leads to the wave equation 



vxE + — = v(v•E)-v^E + — 
a^j dt 



dE\ 

^ioeo-^\=0, (5.3) 



v2-)Uoeo^jE(r,^) = 0. (5.4) 

This differential equation has the well-know real and complex plane wave solutions 

E = EACOs(k-r-wi) + EBsin(k-r-toi) and E = Eoe^^''''"''*\ (5.5) 

where E^, E^, and Eq are a constant vectors, k is the wave vector pointing in the propaga- 
tion direction, and co is the frequency. By virtue of Eq. (5.4), the wave vector and frequency 
obey the relation (k^ = k • k): 

k - fiQeod) =0. (5.6) 

Clearly, in SI units (see Appendix B) [iQeo has the dimension of inverse velocity-squared, 
and substituting the numerical values for /io and cq reveals that this velocity is c, the 
speed of light. Another great triumph of Maxwell's theory was the identification of light 
as electromagnetic waves. For the right frequencies, the above equation is therefore the 
dispersion relation of light propagating through vacuum: 

k^^^. (5.7) 



So not only does Maxwell's theory unify electric and magnetic phenomena, it also encom- 
passes the whole of classical optics. On top of that, it predicts a range of new types of 
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radiation from radio waves and infrared in the long wavelengths, to ultraviolet, X-rays 
(or Rontgen rays), and gamma rays in the short wavelength. Prior to Maxwell's discovery 
of electromagnetic waves, the known types of radiation other than light were infrared, 
discovered by William Herschel in 1800, and ultraviolet, discovered in 1801 by Johann 
Wilhelm Ritter. After Maxwell, Heinrich Hertz discovered radiowaves and microwaves in 
1887 and 1888, respectively He also made important contributions to the understand- 
ing of electromagnetic waves. Wilhelm Conrad Rontgen discovered X-rays in 1895, and 
Paul Ulrich Villard discovered gamma rays in 1900. However, it was not until the work 
of Ernest Rutherford and Edward Andrade in 1914 that gamma rays were understood as 
electromagnetic waves. 

Similarly, Maxwell's equations in vacuum give rise to a wave equation for the magnetic 
field, yielding the real and complex solutions 



Since V • E = V • B = 0, the electric and magnetic fields are perpendicular to the direction of 
propagation, k. Electromagnetic waves are thus transverse waves. It is left as a homework 
question to prove that E _L B. 

Exercise 5.1: Derive the wave equation for B by taking the curl of V x B. 

You may have noticed that we also gave a complex solution to the wave equation in 
Eq. (5.5). However, the electric field must be a real quantity, otherwise we would get 
complex forces, momenta, velocities, etc. This is clearly nonsense. The way out of this 
is that we take the real part of Eq. (5.5) as the physical solution, and we discard the 
imaginary part. We may calculate with the complex solution because it is often easier to 
deal with exponentials than with trigonometric functions. 

5.2 The dispersion relation and group velocity 

The dispersion relation for a wave with definite frequency w and wave number k defines a 
propagation velocity. In vacuum, this is c = 299792458 ms~^. In simple dielectric media, 
cq and /Jo are modified io e- e^eo and ju - y-rfJ-o, where Cr, > 1- This has the effect that 
the speed of the wave with frequency o) becomes smaller. The propagation velocity of a 
(plane) wave with a single frequency is called the phase velocity. 

In many materials, the phase velocity is different for different frequencies. Suppose 
that we create a wave packet of light by superposing plane waves of different frequencies. 
For simplicity we consider a wave in one dimension. The wave packet is given by 



where the function y/ik) gives the amplitude of the plane wave with wave vector k. At ^ = 
the peak of the wave function is positioned at x = 0. 



B = B^ cos(k • r - a)t) + Bg sin(k • r - cot) and B = Bo e 



i(k-r-CL)t) 



(5.8) 




(5.9) 



Exercise 5.2: Show that E{x,t) in Eq. (5.9) satisfies the wave equation. 
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xp(/c) 




ko k 

Figure 20: The amplitude y/{k) of the components in the wave packet E{x,t). 



Suppose that y/ik) is smooth and has a single peak at the value ko, as shovi^n in Fig. 20. 
We can then develop coik) into a first-order Taylor expansion around ko: 



d(i) 

0){k) = OJiko) + ik- ko) — 

dk 



(5.10) 

k=kn 



which gives to a good approximation 
w^here coq = a)iko), and we defined 

dco 



(5.12) 



as the group velocity. The reason for this name is that the peak at time t will be concen- 
trated near x = Vgt, so the velocity of the wave packet (a group of plane waves) is given by 
Vg. In free space the group velocity is the same as the phase velocity c. By contrast, waves 
in general dielectrics tj^pically have a frequency dependency in the phase velocity, and as 
a result the group velocity can be different from the phase velocity In specially prepared 
(absorptive) materials, the group velocity can even be greater than the speed of light in 
vacuum. However, the speed of the actual signal will never exceed c. The higher-order 
terms in the Taylor expansion of w(^) that are not included in Eq. (5.10) cause a distortion 
of the wave packet, and the approximation here works only when the second derivative is 
small compared tot he first derivative. The second derivative of cjik) is called the group 
velocity dispersion. 



5.3 Polarization of electromagnetic waves 

The electric field E is a vector, and a natural question is: what is the direction of E in 
an electromagnetic wave? First of all, we have seen that E must be perpendicular to the 
direction of propagation k. That still leaves two transverse directions. Suppose (without 
loss of generality) that the wave vector is oriented in the z direction: k = (0,0,^). The 
electric (and magnetic) field then has two components, in the x- and y-directions. The 
direction of the E field is called the polarization. 
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Figure 21: An electromagnetic wave with linear polarization, a) The E field is pointed in the x 
direction, while the B field points in the y direction. The orientation of the E and B fields determine 
the polarization of the wave, b) The field lines are closed and form interlocking loops. 

The field must satisfy the w^ave equation in each direction, so we have (for a monochro- 
matic wave vi^ith frequency w-ck) 

E = (E^ e'^-'-''^\Ey e'^-^-''^t+'f, 0), (5.13) 

and a similar expression for the magnetic field. Notice the relative phase (p in the y-com- 
ponent: When (p = or n radians (that is zero degrees or 180 degrees), the expression re- 
duces to E = iE^,±Ey,0)e'^-''-''^^, which is an oscillating wave in the direction (£J^,±£y,0). 
This is called linear polarization. When the relative phase (f) is not a multiple of n, we 
speak of elliptical polarization. In the special case of (/> = n/2 or 3n/2, the polarization is 
called circular (and the two possible phases correspond to left-handed and right-handed 
polarization). 

To see what this means physically, consider circularly polarized light (and E^ = Ey - 

E): 

E(r, t) = £:(cos(k • r - cot), + sin(k • r - wt)) , (5. 14) 

where we have taken the real (physical) part of the field. You may recognize this as a 
t5T)ical rotated vector (cos 0, sin 0) in a plane. If we keep r fixed, the E field therefore rotates 
as time passes. The sign in Ey determines whether the rotation is clockwise or counter- 
clockwise. Similarly, we can keep t fixed and vary r along the direction of propagation. 
This shows that E traces out a corkscrew motion in space. 

A couple of things are worth mentioning: There is a difference between changing the 
values of Ex and Ey, and changing the relative phase (p. For waves with linear polarization 
we can always find a reference frame in which either E^ or Ey are zero, but this is not 
possible for elliptical polarization. Second, the polarization is determined completely by 
the E field (we do not rely on the specific form of the B field at all). Nevertheless, the B 
field must be perpendicular to both E and k. In addition, the fields must satisfy Maxwell's 
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equations, which means that for plane waves 



g IkxE 

c k 



(5.15) 



We could have derived the properties of polarization from the B field, however. In that 
case the E field would follow from the expression of the B field. 



5.4 The wave equation with sources 

So far we have set p = J = 0, but in general this is not the case. Following the steps at 
the beginning of this section with arbitrary p and J gives a new set of equations. We start 
again with 



Vx 



5B^ 



d 



VxE+ — = V(V•E)-V^E + — 
5^ j dt 



dE 



(5.16) 



Substituting V • E = pIcQ, this leads to 



(5.17) 



This is a second-order differential equation for E, and does not depend on the magnetic 
field B. Can you reduce this to the Poisson equation for the electrostatic case? (Hint: all 
the time derivatives will be zero.) 

Similarly, we can take the curl of V x B, which gives 



Vx 



aE ^ o d^B 

V X B - /ioeo-^ - jUoJ = V(V • B) - V^B + ^0^0-^ + j"oV X J = 0. 



(5.18) 



Using V • B = this gives rise to the second-order differential in B: 



I 52 x 

v2-;Uoeo^ B = )UoVx J . 



(5.19) 



This differential equation does not contain the charge density p, since static charges do 
not generate magnetic fields, and we notice that the magnetic field depends on the curl of 
J, as expected. 

The two differential equations (5.17) and (5.19) are equivalent to the four linear first- 
order Maxwell equations, but these have the advantage that we have separate equations 
for the electric and magnetic fields. Being second order equations, they need an extra 
boundary condition to specify a particular solution. 
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6 Energy and Momentum of Electromagnetic Fields 

The electric and magnetic fields are very much physical objects in their own right. This 
means that they not only have energy, but also momentum, stress and angular momen- 
tum. 



6.1 Poynting's theorem and energy conservation 

One of the most fundamental principles of physics is energy conservation. It is exact, as 
far as we know, and the question is therefore how it is manifested in electrodynamics. To 
find out we start with a simple vector identity: 

V-(ExB) = B-(VxE)-E-(VxB). (6.1) 

We can substitute the Maxwell equations for the curl of E and B to obtain 

dB dE 

V-(ExB) = -B- — -/ioCoE- — -iUoE-J. (6.2) 

Moreover, the cross product of E and B is related to the Poynting vector 

_ ExB 



)"0 

Putting this all together and simplifjdng the time derivative we have 



(6.3) 



(BB eoE-E\ 

+ -^^^ =0. (6.4) 



d 

V-S + E-J+ — 

dt {2^0 2 

The question is now: what does this mathematical theorem mean? 

Exercise 6.1: Show that the terms in Eq. (6.4) have units ofpower density. 

Since these are densities, to interpret this theorem properly, let's integrate Eq. (6.4) 
over an arbitrary volume V bounded by a surface S : 

d 



Is-ndS+f J-Edy+— f UdV = 0, (6.5) 
Js Jv dt Jv 



where we abbreviated 



^^B^^coE^^B^^ejE^ ,6.6) 

2^0 2 2^0 2 

We again used Gauss' theorem to relate a volume integral to a surface integral. The first 
term is the rate of the energy flow out of the surface S. In the second term we calculate 
how much cargoes move along the direction of the electric field, which is a measure of the 
rate of work done on the charges by the electric field E. We need the current density J, 
because the charges have to be moving^. The final term in Eq. (6.4) is the remaining part 



^The work done on a charge in a time interval At is given hy W -d-F- qv-EAt- J- EAt. 
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of the puzzle, namely the change in energy contained in the fields E and B. The quantity 
U is therefore the energy density of the electromagnetic field. 

This is called Poynting's theorem of conservation of energy The energy can leave the 
fields in a region of space, but it must then be either transported through the surface 
(measured by the Poynting vector), or it must be used to do work on the charges in the 
volume. When we make the volume infinitesimally small, you see that energy conservation 
is not only true globally, but it is true locally, at every point in space. 

6.2 Momentum and stress in the electromagnetic field 

It is not surprising that the electric and magnetic fields carry energy. After all, you're 
quite familiar with the notion that it is the potential energy in the fields that make charges 
move. The general behaviour of the energy in the fields is described by Poynting's theorem, 
discussed in the previous lecture. 

The electromagnetic field also has momentum. There are several immediate reasons 
why this must be the case: 

1. The theory of electromagnetism is a relativistic theory, and we know that energy in 
one frame of reference gives rise to momentum in another reference frame. There- 
fore, to have energy is to have momentum"*. 

2. Last year you learned about radiation pressure, so you know that electromagnetic 
waves can exert a force on objects, and therefore transfer momentum. 

3. When two charged particles fly close part each other with small relative velocity (v « 
c), Newton's third law holds, and the momentum of the two particles is conserved. 
However, when the relative velocity becomes large, the particles will each experience 
a B field due to the other particle's motion. We can configure the situation such that 
the total momentum of the two charges is not conserved (see exercise), and in order 
to save Newton's third law, the field must be imbued with momentum. 

Now let's study the properties of the momentum of the electromagnetic field. Since mo- 
mentum is closely related to force, we first consider the Lorentz force. 
We can write the Lorentz force AF on a small volume AV as 



we now define the force f on a unit volume exerted by the electromagnetic field as f = 
AF/AV" = pE + J X B. Since we are interested in the fields, and not the charge or current 
densities, we want to eliminate p and J from the expression for f. We use the inhomoge- 
neous Maxwell equations for this: 



AF = (pE + JxB)Ay. 



(6.7) 



f =e-o(V-E)E + 



I 1 



dE 



) 



xB. 



(6.8) 



V X B - Co 



dt 



^Even though the actual value of the momentum in specific frames may be zero. 
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Next, we wish to rewrite the term E x B using the chain rule: 

aE „ diExB) „ dB ^^^^ 
— xB= -Ex^- (6.9) 

dt dt dt 

Also, the term (V x B) x B can be rewritten as 

— (VxB)xB= — (BV)B-— VB2 = — (B-V)B + — (V-B)B-— VS^, (6.10) 

where we used the fact that V • B = 0, always. We put this term in to make the expression 
more symmetric. This leads to a lengthy expression: 



f = eo [(V •E)E + (E • V)E] + — [(V •B)B + (B • V)B] - VC7 - 4? • 

jUo dt 



(6.11) 



or in index notation 



fj = ^0 



idiEi)Ej + iEidi)Ej - -djiEiEi) 



+ 



idiBi)Bj + iBidi)Bj - -djiBiBi) 



1^ 

c2 dt 



where U is the energy density per unit volume, and S is the Poynting vector. The first two 
brackets can be written as a divergence: 



fj = di 



EiEj- — 



''E^\ + 



1 



BiBj 



c2 dt 



(6.12) 



We define the quantity in square brackets as the Maxwell stress tensor Tij: 



Tij = eo \EiEj - I + - \BiBj - ^B 



(6.13) 



It is a symmetric rank two tensor and for all practical purposes behaves like a matrix. If 
we understand the divergence operator to act on one index to T we can write 



^ 1 

f = VT 

c'^dt 



or fj = diTij--,dtSj. 



When we integrate this over a volume, the Lorentz force on that volume becomes 

1 as^ 



^-1 

Jv 



VT 



:^ dt 



I Js dt Jv 



(6.14) 



(6.15) 



where da is the surface increment on the boundary surface S of volume V. The tensor T 
therefore has units of force per area, or stress. The first term on the right-hand side is the 
pressure and shear on the volume, while the second term is the momentum transfer to the 
electromagnetic field. 

There is a very intuitive picture for the meaning of the stress tensor in terms of the field 
lines. The stress tensor tells us that the field lines cannot be bent without any cost. They 
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Figure 22: Arcs of magnetic field lines guiding hot plasma. 

have a tension in them that must be overcome. Imagine bringing together two particles 
with equal charge. As you move them closer, the field lines must bend more and more, and 
we have to apply a force to bring them together. But this is just another way of saying that 
like charges repel each other! The opposite situation is also easily explained: two opposite 
charges are connected by curves field lines. Since the field lines want to straighten out, 
the two charges move toward each other to make this happen. 

A much more spectacular manifestation of tension in field lines occurs at the surface 
of the Sun. The magnetic field of the Sun is a giant tangle of field lines that get twisted 
and stretched due to the motion of the material inside. A hot, glowing plasma of charged 
particles spirals around the magnetic field lines and form the beautiful arcs on the surface 
shown in Fig. 22. These arcs explode into space, producing directed streams of high- 
energy charged particles that can knock out satellites and whole parts of the power grid. 
Maxwell's stress tensor explains why this is so. 

The field lines in the arcs shown in Fig. 22 are not all going in the same direction. 
Rather, these are areas where field lines are oriented in opposite directions, creating a 
huge magnetic field gradient across relatively low-tension field lines (Fig. 23a). When the 
gradient becomes too high, the plasma causes a catastrophic breakdown of the field, and 
the field lines "reconnect", as shown in Fig. 23b. However, now the field lines are bent 
to the extreme, and all that tension will cause the lines to move outwards, straightening. 
Charged particles trapped around theses field lines will be accelerated to very high speeds 
and launched into space (Fig. 23c). These are solar prominences. 




Figure 23: Origin of solar prominences. 
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7 Radiation Sources and Antennas 
7.1 Electric dipole radiation 

So far we have looked at properties of electromagnetic waves (radiation) without asking 
how they are produced. Since radiation is an electromagnetic effect, we expect that it is 
created by electric charges. However, both static and uniformly moving charges cannot 
produce radiation (a uniformly moving charge is the same as a static charge in a different 
frame) because there is no characteristic time scale Ci)~^ in the physical system. We there- 
fore need to look at accelerating charges, and to this end we construct an electric dipole 
in which the charge q oscillates between two points separated by a distance s (see figure 
24). The charge at the top of the dipole can be written as Q = g e~""^ and we have a dipole 
p = sqe~^'^^ in the z direction. This is equivalent to a sinusoidal current /oe~""^ 



r 




Figure 24: The Hertzian dipole: Radiation from an electric dipole current. 

We now calculate the scalar and vector potentials in spherical coordinates for this 
situation. At some distance r the scalar potential due to the top of the dipole is given by 

^ir,t') = ^ = = = ^Rir,t), (7.1) 

47i£or 4neor 4n£or 

where t' = t - r/c is the retarded time: It takes a while for the change in the potential at 
r = to propagate to r. To take this into account, we introduced the retarded potential 
^R(f,t), and we used the dispersion relation for radiation in free space k = co/c. 

Next, we add the two retarded potentials of the two charges and = -O^^ in 
the dipole. They almost cancel, but for the small separation s in the z direction, which 
gives rise to 6r in the r direction: 

Oi?(r, t) = 0<j+' + O^-^ = (D5j+ V, t) - o5j+ V + 5r, t) - -6r (7.2) 
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Figure 25: The directions of the E and B fields of a spherical wave radiated by a dipole in the 
centre. After half the oscillation period the fields point in the opposite direction. 



Substituting Eq. (7.1) and 6r = scosO we obtain 

^Rir,t) - -scosd 



dr 



PrU cosd I 1 



AncQr 



(7.3) 



where pR is the retarded dipole moment. 

The vector potential of this oscillating dipole in spherical coordinates is 



fiQ r JR(r,t) 



An J r AneoC^r AncQC^r 



1 



Irs, 



(7.4) 



where s is the dipole vector s(cos0r- sin00), parallel to the z axis, and Ir is the retarded 
current in the dipole. Using the relations Q = qe~^'^^ and I = dQ/dt, we can write the 
dipole moment pr in the scalar potential in terms of Ir: 



We therefore have 



^R(r,t) 



Irs cos 6 
Aneocr 



1 + 



kr 



IS 

PR = —IR' 
(ji) 



and AR(r,t) 



lRs(cos6r-sin66) 
AncQC^r 



(7.5) 



(7.6) 



The electric field E is given by E = -VOij -Ar. Using the derivatives in spherical 
coordinates we find 



Erir,t) 



.i X kiRS cosd 

(AR)r 



dr 



AncQcr 



( 2 2i 

+ 
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r 




Figure 26: Radiation from a magnetic dipole current. 



laojj . kiRS sine 

Ee{r,t) = —-iAR)e 



39 AncQcr 

We already know that radiation carries energy, so the total energy flux through a closed 
surface around the source is constant. If we take the surface to be that of a sphere with 
radius r, the energy flux Anr'^{e\E\'^l2 + |B|^/2;Uo) must be constant for all r. The radiating 
part of the E and B fields are therefore proporional to 1/r. Looking at Eq. (7.7), we see 
that the only contribution to the radiation is due to the i term inEQ. Similarly, the only 
contributing term of the B field is (see exercise) 

.klRSsind Ee(r,t) 

BAv,t) = -I— 5— = , (7.8) 

AneQC^r c 

where ^^^(r, t) now denotes the radiating part of the E field. Poynting's vector is given by 



ReEe x ReB^ E x B* 1 
- Re- 



(7.9) 



Note that we can calculate the Poynting vector by taking the complex conjugate of the B 
field and take the real part of S afterwards. Again, it is often much easier to calculate 
with complex phase factors of the form e'''^ than with the trigonometric counterparts sin^, 
cos^ and tan (p. 

7.2 Magnetic dipole radiation 

Now suppose we have a small current going in a square loop of area s^, leading to a 
magnetic dipole moment m = s^Iq e~"^^ The current is changing in time in a periodic way. 
Again, we can define the retarded magnetic dipole by substituting t t-r/c. If the current 
has a low impedance the scalar potential is zero, while the vector potential is 



^oniRksind 

ARir,t) = 

4:nr 



~ + i\cf>. (7.10) 
kr 
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We can again field the radiating parts of the E and B fields via E = -A and B = V x A: 

^ mSp^^.^Be. (7.11) 

and the Poynting vector is 

{ksflsirvd 



S _j^^. ExB* _ 1 
The difference between Sg and S;„ is a factor {ksi^. 



(7.12) 



7.3 Radiation from accelerated charges 

Clearly, radiation is generated by accelerated charges and changing currents. In the pre- 
vious sections we have looked at periodically changing charges and currents, but in many 
situations the charges and currents change in an aperiodic way. For example, when a 
charge is kicked, it will emit a burst of radiation. We will now consider the question of 
how much power P is radiated when a charge is accelerated by an amount a. This will 
lead to Larmor's formula. 

Suppose a particle with charge q follows a trajectory Vqit). We can formally associate 
a dipole moment p to this particle: 

p(t) = qrq{t). (7.13) 

The current due to the motion of the charge is clearly related to the velocity v^, so we can 
write 

The acceleration of the particle is proportional to the second derivative of the dipole 
moment: ^ 

^ = ga,(0. (7.15) 

Next, we calculate the vector potential and the magnetic field B of the moving charge. 

First, we note that the current I is the spatial integration over the current density, so 
we have 

\( ^\ 1^0 f J, , ^ , , ^0 T Ho dp(t-r/c) 

A(r, t) = - — / J(r , ^ - r/c)dr = - — Ir = — , (7.16) 

4nr J Anr Anr dt 

evaluated at the retarded time, because the field needs time to propagate from the charge 

to the point where the vector potential is calculated. 

The magnetic field is found using B = V x A. We need to keep in mind two things: 

(1) We can ignore all the contributions that do not scale like 1/r, because they do not 

propagate. (2) We must remember that the time variable depends on r because we consider 

the retarded time. The magnetic field is then 



B(r,0 = VxA(r,0=7^ 



l\dp 1 dp 
\r dt r dt 



^°-Vx^, (7.17) 



4jir dt 
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where the last equaUty holds because V x (l/r) leads to a non-propagating contribution to 
the field. The calculation of the curl of dp/dt is a bit tricky, so we'll do it here in some 
detail. In particular, it is convenient to use index notation: 

Using the chain rule, we can write (t = tq- ric with ) 

dt d Ixj d 
oxj dt c r dt 

which allows us to rewrite the cross product: 

(d^p\ 



dp\ 1 X, 



[dt^l, 



(7.20) 



In vector notation, this becomes 



dp 1 d^p 1. d^p 



c 



and therefore the B field can be written as 



B{r,t)=- — Vx — = -- rx—- = -- rxa^, (7.22) 

Anr dt Ancr dt'^ Ancr 

evaluated at time t, and due to an accelerated charge at the retarded time t - rIc. 

The radiating part of the electric field E must be perpendicular to the radiating part 
of the magnetic field that we just calculated, and they differ by a factor c. Keeping the 
correct handedness, the radiating E field can be written as a cross product between B and 
r: 

E(r,0 = cBxr = -— (rxaJxr. (7.23) 
47rr 

Now that we know the E and B fields, we can calculate the PojTiting vector S, which gives 
the energy flux through a surface. If we take the surface to be a sphere with radius R 
surrounding the accelerating charge, we can calculate the total radiated power 



SRR^dn, (7.24) 



where dD. is a solid angle. 

The Poynting vector is straightforward to calculate: 

S = ^ = 7#T^ [(r X a^) X r] X (r X a,) . (7.25) 
Using the vector identity A x (B x C) = B(A • C) - C(A • B) we rewrite this as 
^ = 16;r4e3r^ [a, -r(a, -r)] x (r x a,) 
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{r [lag f - {3iq -vf]- 3iq [3iq ■ V - 3iq ■ v)] 



.2 



^ r 2 / -^\2i 



21^ 



r, (7.26) 



where we defined aq = \3Lq\ and we oriented our coordinate system such that the accelera- 
tion is along the z direction. The inner product -r is therefore equal to a q cos 6. 

The radiated power through a sphere surrounding the accelerating charge is now 
straightforward to calculate: 

P=isRR'da=j J RUn = ^^^ sin'edod^. (7.27) 

The integral over sin^0 is equal to 8n/3, so we arrive at 

1 VaJ 

P = - (7.28) 

This is the celebrated Larmor formula, derived by Joseph Larmor in 1897. 



page 62 



PHY381 : Advanced Electrodynamics 



8 Electrodynamics in Macroscopic Media 

So far, we have considered the fields of (moving) charges and the interactions between the 
charges. In principle, this covers all classical electromagnetic phenomena, since all matter 
is made up of charged particles that interact with each other. However, if we are dealing 
with 10^^ atoms we don't want to solve the Maxwell equations for every individual atom. 
Especially when the time and length scales relevant to our problem are large compared to 
that of the atoms, we much rather average over the atomic fields and have effective fields 
in the media. 

The electric and magnetic fields in Maxwell's equations are now average fields <E> 
and (B>, and the charge and current densities <p> and (J> now consist of free charges 
and currents as well as bound charges and currents. Maxwell's equations in macroscopic 
media then become 

V-<B> = and V x <E> + = (8.1) 

at 

V-<E) = ^ and Vx<B)-)Uoeo^=i"o<J). (8.2) 
cq at 

The charge and current densities are 

{p) = Pf + Ph and {J) = Jf + Jb (8.3) 

where the subscripts f and b denote "free" and "bound", respectively. 

The fields in the materials are no longer the bare E and B fields due to freely moving 
charges and currents (as was the case for the microscopic Maxwell equations), but they are 
modified by the presence of the bound charges and currents. We want to relate the mod- 
ified fields to material properties (the bound charges and currents) and modify Maxwell's 
equations such that they involve the macroscopic fields and the free charges and currents. 

8.1 Polarization and Dispiacement fieids 

First, we consider a material that has bound charges. These are, for example, electrons 
that cannot move freely through the material, but that are bound to the parent molecule. 
When an external electric field is applied, the negative electrons move with respect to the 
positive nuclei, creating a small dipole moment. This dipole moment is aligned against the 
external field, because the external field pulls the charges such that the local field becomes 
smaller. All these little dipole moments create a macroscopic field P, which is related to 
the bound charge density via 

P6 = -V-P. (8.4) 

This is in some sense the definition of P. We must now relate the P to the dipole moments 

of the molecules. 

Let V be the volume of the medium, and the dipole density in the i direction is pb^i. 
Then 

r ^ r dP ■ ^ c rjonS /. 

Kipbdr^-Y^ n-^dr^-Y^ riPjd^rk^j + Ptdr, (8.5) 

JV j=iJv Orj j^^Js rjonS JV 



Section 8: Electrodynamics in Macroscopic Media 



page 63 



^ ^ 

O O cH^ 

Uniform polarization 



(0 ^ 

(9 

(E^ 
^ ^ 
^ ^ ^ ^ 

Nonuniform polarization 



Figure 27: Polarization in a medium 



where the first equaUty comes from Eq. (8.4), and the second comes from partial intergra- 
tion. The surface term is zero when we take the surface just outside the medium. In vector 
notation, we therefore have 

f rp6dr= \ Pdr = ^p„, (8.6) 
Jy Jy y 

where the last term is the sum of all microscopic dipole moments in the volume Y . Outside 
Y we have P = 0. 

Now that we know what P means, we can rewrite the Maxwell equation V • (E) = (p)/eo 
using Eqs. (8.3) and (8.4)as 

pf + Pb Pf 1 
V-(E) = ^ =— V-P. (8.7) 

eo Co eo 

Considering we want the macroscopic field in terms of the free charges, we can define the 
macroscopic field as the displacement field D: 

D = eo(E) + P (8.8) 

All the material properties are determined by P. The corresponding macroscopic Maxwell 
equation is then 

(8.9) 



V-D = p/- 



Often the polarization field is directly proportional to E, and we write P = x^o^, where 
X is the susceptibility of the medium. The displacement field then becomes 

D = eo(l + x)E = eoerE = eE. (8.10) 

Assuming for the moment that the permeability po does not change, one can see imme- 
diately that the wave equation yields a propagation velocity c^/cr- In ordinary materials, 
we therefore have > 1. As we will see in the next section, this is directly related to the 
index of refraction. 
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Figure 28: Magnetization in a medium. 
8.2 Magnetization and l\/lagnetic induction 

Now we will construct the macroscopic magnetic field using the free and bound currents. 
Apart from the free currents in the material, there may be bound currents that are con- 
fined to the molecules. In quantum mechanics these bound currents can be the spins of 
the electrons and nuclei, but we'll keep things classical. 

The bound currents may be due to a change of bound charges, which can be written 
as dP/dt. Alternatively, they may correspond to little current loops, which give rise to a 
microscopic magnetic dipole moment. Under the influence of an external field B, these 
little magnets can line up to form a macroscopic field that we shall denote by V x M. We 
then have the following expression for the bound current: 

dP 



+ VxM (8.11) 

dt 



The divergence of the bound charge is 



^•Jfc = ^^ + '^-(^^M) = -^ = 0, (8.12) 

dt dt 



since the bound charge is conserved. 

Now let's consider the second inhomogeneous Maxwell equation 



5<E) _ d 

V X (B) - i^oeo—— = V X (B)-^oeo — 
dt dt 



---] = /^o<J> = Mo(J6+J/). (8.13) 



This leads to 

dB dP dP 

V X (B) - luo— + iiQ— = iiQ— + iuqV xM + idoJf . (8.14) 

We can now define the magnetic field H in terms of the magnetic induction B and the 
macroscopic magnetization M as 

H=— <B)-M, (8.15) 
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yielding the Maxwell equation 

(8.16) 



VxH ^Jf 

dt ' 



The homogeneous Maxwell equations remain unchanged, since here E and B are fun- 
damental (that is, no material properties are present in these equations). Also, for the 
macroscopic case it is understood that all fields are averages, and we drop the brackets <.). 
The macroscopic Maxwell equations then become 

V-B = and VxE + — = (8.17) 

dt 

V-D = p^ and VxH- — =Jy^. (8.18) 
8.3 Waves in macroscopic media 

When waves penetrate a medium, the molecules in the medium respond to the electric field 
and the magnetic induction to form polarization and magnetization fields. Let's assume 
that the medium is electrically neutral, so that there are no free charges (p = 0), but there 
may be electrons in a conduction band that can produce free currents J. These electrons 
presumably respond directly to the electric field, such that the free current is propotional 
toE: 

J = o-E (Ohm's law). (8.19) 

This is Ohm's law, and the constant of proportionality a is the conductance. This law has 
a wide range of applicability, but you should be aware that there are many interesting 
situations where Ohm's law does not hold. 

There are essentially three regimes of interest for a: (1) a is small, and the medium is 
a dielectric; (2) a is large, and the medium is a conductor; or (3) a is imaginary, and the 
medium is a plasma. We will derive the index of refraction for dielectrics, the skin depth 
for conductors, and the plasma frequency for plasmas. We will also derive the reflectivity 
of these three types of materials. 

Waves in dielectrics 

First, we rederive the wave equations from the Maxwell equations, but this time we keep 
the J term. We find the following differential equations for E and B: 

V^B-jue— ^ = -)uVxJ, (8.21) 

where e = e^eo and p = [J-r^o- Using Ohm's law, this leads to 

o a^E dE 
V^E-,.^-,.- = 0, (8.22) 

9 d^B dB 
V^B-iM--^-l^a— = 0. (8.23) 
dt'^ dt 
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We substitute again the plane wave solutions for waves in the positive z direction E 
^^^i{kz-oyt) g ^ Boe'^*^"'^^^ and we immediately find that 



k = yeuo)^ + iojua = ko^/Il^\ l-\ , (8.24) 

* y we 

where ko = a)/c. You can see that the wave vector k is complex! 

Now we can look at the three different regimes for a. If the conductance is small 
(a/oje « 1), the medium is a dielectric and the imaginary root is approximately equal to 
1. The wave vector is real, so the waves propagate freely. However, the value of the 
wave vector is modified by the factor ^e^jUr. It turns out that this is indeed the index of 
refraction, at least for apolar materials^. 

For ordinary dielectrics where D = eE and B = /iH, the ratio between the electric field 
and the magnetic induction is given by the velocity of the wave in the medium: 

E c 

- = . (8.25) 

In air, this leads to approximately E = cB, and the (average) Poynting vector in a medium 
is then 

Say = ncQcE^,^,, (8.26) 

where n is the index of refraction, and -Efuis average strength to the electric field 

squared. 

Waves in conductors 

In a conducting medium a is very large, and we can ignore the +1 term in Eq. (8.24). The 
wave vector k becomes 



V (i)eo 



Since the wave propagates in the positive z direction, we choose the positive sign. We 
defined the skin depth 6 as 

6^ J—. (8.28) 

When we substitute this value for k into the solution for the E field we obtain 

i(J-io^)]exp(-|), (8.29) 



E(2,^) = Eoexp 



that is, there is a propagating term and an exponentially decaying term, which falls off to 
when the wave has reached the skin depth. 



^In apolar media the distribution of the electrons in the molecules is fast enough to follow the oscillating 
fields. In polar materials, however, the response of the medium is slow since the polar molecules have 
to re-orient themselves. This is the principle behind the microwave: The water molecules try to realign 
themselves with an oscillating microwave field, and as a consequence the water gets heated. 
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The magnetic field is given hy B = kE/o), where now k is complex. With the time 
dependence made explicit, we find that 

B = [^Eoe'^'^^-''*^"''^^e-''^ . (8.30) 
V (0 

Note the n/4 phase lag of B with respect to E. Inside the metal we have 

Bo = .f^Eoe'"'^ (8.31) 
V (0 

and the Poyning vector is 



This shows that the power falls off twice as fast as the fields. 
Waves in plasmas 

Finally, consider a plasma in which the electrons are disassociated from the atomic nuclei. 
When an electric field is applied the electrons accelerate. The current density J therefore 
lags the electric field, and the phase happens to be e*'^''^ = i. The nuclei are much heavier 
than the electrons, so they do not accelerate as fast and make a negligable contribution to 
the conductivity. We can find the velocity v of the electrons by integrating F = ma = eE, 
where all vector quantities are in the z direction. 



m J 



dtEoe'"'^* = —E, (8.33) 



where we explicitly wrote the time dependence of E. Using the electron density Ne, the 
current density is 

ieE J ie^N 

J = Ne\ = Ne , and (r = - = . (8.34) 

corn E o)m 

We substitute this into our expression for the complex wave vector k 



k = ko\ 1- 



e^N / (CDp^^ 
= ko\ l-[^] , (8.35) 



where we defined the plasma frequency 



e^N 

0Jp = \ . (8.36) 

V f^^o 



Clearly, when the frequency of the wave (o is larger than the plasma frequency, k is real 
and the plasma acts as a dielectric. When axojp the plasma acts as a conductor. 
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8.4 Frequency dependence of the index of refraction 

The very simple description of waves in dielectric media with constant e and ju cannot be 
the whole story. We know that in real materials the index of refraction has some frequency 
dependence, or the phenomena of prisms and rainbows would not exist. We will now give 
a simple model of dielectric materials that leads to such a frequency dependence. 

Imagine that the material is made up of atoms whose bound electrons respond to the 
electromagnetic wave. The electrons will follow the electric field and will oscillate in, say, 
the x-direction. Since the electrons are bound (and the wave intensity is relatively low), 
we can model the electrons as sitting in a harmonic potential. The equation of motion for 
the position x of each electron is then given by 



d X dx 



m 



'-^-mr'-^ + mo)ix = qEoe~''^\ (8.37) 
dt'^ dt 

where m is the mass of the electron, q its charge, wq is the resonance fi'equency of the 
electron, and 7 is a damping factor that will siphon energy from the wave in a process gen- 
erally called dissipation. The driving force on the electron is given by the electromagnetic 
wave with frequency o) and amplitude Eq. Note that we are treating the atoms in a purely 
classical way, and not quantum mechanically. This is sufficient for our purposes here, but 
it is of course not a true description of the dielectric material. 

It is easy to verify that the solution to this differential equation is given by the complex 
position function 

q Eq e-''^^ 

x(t)=^ 2 9- • (8.38) 
The dipole moment of the electron then becomes 

p(t) = qx(t)=^ „ \ . ■ (8.39) 

m (0^-0)"^ - lyo) 

Since this is an oscillating dipole it will generate radiation at the driving frequency 10. 
The driving wave is propagating in one direction and the dipole emits in a radial direc- 
tion, so this amounts to losses in the energy of the wave. The polarisation field of the 
dielectric medium is given by the density of dipoles in the material, and we can define the 
polarisation field as 

Nq^ 1 

P=-^^ 2 ^^E, (8.40) 

m (Oq-o)^- ijco 

where N is the density of electrons with resonance frequency ojq. If there are multiple 
resonance frequencies in the material we add them to the polarisation field with the corre- 
sponding density. In this case, both P and E are complex, and they obey the relationship 



P = ieoE, 



(8.41) 
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Figure 29: Index of refraction n-1 and absorption coefficient a near the resonance frequency ojq 
of a simple dielectric material. 



as before. The complex dielectric constant is then given by 

/ Na^ 1 ^ 

e = eo{l + x] = eo 1 + — 5 • (8-42) 

\^ m - - ijo) j 

The wave in the dielectric medium therefore has a complex wave vector k - cj^/eji. If 
we assume that ^ = y-Q and that N is relatively small, we can use the approximation 
vTTx a 1 + ^x, and we find 



c 



In turn, the index of refraction is given by the real part of ck/co, which is given by 

cReik) Nq^ 
^ = = ^+0 ^ TT- ^8.44) 

The imaginary part of the wave vector causes an exponential decay of the wave amplitude, 
and is therefore a measure of the loss of energy in the wave. Since the intensity of the wave 
is proportional to E^, the absorption coefficient a is given by 

a = 2Im(^) = ^ — . (8.45) 

The index of refraction and the aborption coefficient are shown in Fig. 29. 

You see that the index of refraction n dips below 1 at frequencies higher than the 
resonance frequency. This means that the phase velocity will be larger than c! However, 
it is the group velocity that measures the velocity of the energy in the wave, and this will 
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still be lower than c. In the limit of negligible damping, far away from the resonance 
frequency, the group velocity is 



do) _ 

1 + 



2meo co^-iOq 



The region where the index of refraction drops below one is called anomalous dispersion. 
Exercise 8.1: Verify the formula for the group velocity in Eq. (8.46). 
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9 Surfaces, Wave Guides and Cavities 

Electric and magnetic fields are often confined in waveguides or cavities. In this section 
we will determine the boundary conditions of the fields at the surface between two ma- 
terials. We then consider waves confined in one dimension, which leads to the concept of 
reflection and transmission. Waves confined in two dimensions create waveguides, and 
three-dimensional confinement of waves leads to the concept of cavities. 



9.1 Boundary conditions for fields at a surface 

The boundary conditions for the fields E, D, B and H are entirely determined by Maxwell's 
equations in macroscopic media. In particular, each Maxwell equation gives a boundary 
condition. Let the surface between two materials (1) and (2) be parallel to the -plane at 
2 = (with material 1 in the region z < 0). We start with the equation for the displacement 
field 



V D 



Pf- 



(9.1) 



The component that is normal to the surface then obeys dxDz = Of, where af is the 
free surface charge. We can compare on either side of the surface at z = ±e, such that 



Df{E)-Df{-e) 



k = ADi . 



(9.2) 



Note that is a vector, while Dz is just a component. When there is no free surface 
charge (a/- = 0) the normal part of D is continuous (AD^ = 0). In general, the boundary 
condition for D is that 



ADi = o-^. (9.3) 

This boundary condition determines AE^ via the constitutive relation between D and E. 
The boundary condition for E|| is determined by Maxwell's equation for V x E 

VxE = . (9.4) 

dt 

We can integrate this equation over a surface S that intersects the boundary between the 
two materials, and use Stokes' theorem to turn the surface integrals into line integrals: 

^E-dl = -J^-da. (9.5) 

Next, we take the contour as a rectangle perpendicular to the surface and sticking half- 
way into the surface (see Fig. 30). The parallel sides of the loop have length I, and the 
sides perpendicular to the surface have length IS.z. We therefore have 

Z(Ej|^>-E[f') = -ZA2^ = 0, (9.6) 
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Figure 30: Transverse boundary conditions on a surface. 

where the last equahty follows from taking the limit Az 0. We therefore have the second 
boundary condition 

AE|| =E[|"-E[^^ = 0, (9.7) 

which determines AD|| via the constitutive relation between D and E. 

The boundary condition from the equation V • B = follows along the same lines as 
that for V • D, but this time there is never a surface charge (since there are no magnetic 
charges). We immediately deduce that 

ABi = 0. (9.8) 

This determines AH^ via the constitutive relation between B and H. 
Finally, we consider the Maxwell equation 

dD 

VxH= — +Jf. (9.9) 
dt 

Integrating over a surface intersecting the two materials and using Stokes' theorem then 
yields 

jB.dl = J^-da + Kf, (9.10) 

where Kf is the free surface current. Taking again the contour shown in Fig. (30) we find 
that 

AH.\\ ^Kf. (9.11) 

Again, AB|| is determined via the constitutive relation between B and H. In summary, the 
general boundary conditions for the electromagnetic fields at a surface are given by 

AE|| = and AD_i = af, 

ABi = and AH||=i<:^. (9.12) 
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9.2 Reflection and transmission of waves at a surface 

One of the most common phenomena is the reflection and transmission of waves at a 
surface, for example light travelling through glass or reflecting off a mirror. We will now 
calculate the reflection and transmission coefficients for electromagnetic waves hitting 
dielectric and conducting surfaces. The fields of the incoming waves carry subscript /, the 
transmitted wave fields carry subscript T, and the reflected waves carry subscript i?. 

Dielectrics 

For the reflection and transmission of waves at the boundary between two simple di- 
electrics we use the constitutive relations 



D = e^^^E and H = 



1 



0) 



B, 



(9.13) 



where j = 1,2 denotes the medium. In addition, we can determine the direction of the B 
field by constructing B = k x E/a> and noting that the frequency is the same in the two 
media. We can furthermore select the normal and transverse components by taking the 
dot product and the cross product with the normal vector n. The boundary conditions in 
the absence of surface charges and currents then become 



AB;, 



,(2)i 



•n = 



k/ X E/ + kj? X E/j - kr X e^^^Er 



•n = 



Ej + Ei? -Er 



xn = 



;jl)k/xE/ + ^k«xE«--^krxEr 



xn = 0. 



(9.14) 



When a beam is hitting the surface at a 90 degree angle, the normal vector n is always 
perpendicular to the fields (since electromagnetic waves are transverse waves). In this 
special case we only deal with AE^ = and AH^ = 0. For a dielectric with Cr and jir = 1, we 
write k/ = -ki? = ^/e^T, and the boundary conditions become 



Ei + Er= Et from /SEt = 



E 

E/-Eij = — ^ from AHf = 0. (9.15) 

Eliminating Ey, we have 

1^ = ^ = ^. (9.16) 

El n + 1 

The reflectivity i? of a surface can be defined as the ratio of the reflective and incident 

Poynting vectors. Since the Poynting vector is proportional to in a dielectric medium 
(B = nkElo)), the reflectivity becomes 
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Figure 31: Waves incident on a dielectric. The waves propagate in the dielectric, (a) The electric 
field is mostly transmitted, and the reflected wave experiences a phase shift, (b) The magnetic field 
is mostly transmitted, and the reflected wave does not experience a phase shift. 



For glass, the index of refraction is ai = 1.6, so the normal reflection is about ten percent. 

Note that the reflected B field picks up a relative minus sign. This is because the 
Poynting vector must point in the opposite w^ay to the incident w^ave, and from Eq. (8.25) 
we know that the magnetic induction must gain in relative strength in the medium. 

Exercise 9.1: Calculate the reflectivity for a wave coming in at an angle 6 to the normal 
of the surface. 

Conductors 

The reflectivity of metals is high, and consequently the transmittance T is low (R + T = 1 
from energy conservation). Using again B = kE/co, and the boundary conditions of the 
previous section, we once more use 

Bt 

Ei-Er=Et and Bi + Br = —, (9.18) 



and find 

Et _ 2 ^2 
Ej 1+n n 
where n is now a large complex index of refraction 



(9.19) 



n^X-^. (9.20) 

V (^^Q 

In addition, we can solve Eq. (9.18) for the magnetic induction, which yields 

Bt 2 

— = ^2. (9.21) 

Bi 1 + lln 

The ratio of the Poynting vectors for the transmitted and incident waves is therefore 

St 4 

— ^- = Re-. (9.22) 
Si n 
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Figure 32: Waves incident on a metal, (a) The electric field is mostly reflected with a phase shift, 
(b) The magnetic field is mostly reflected, and does not have a phase shift. There is no propagation 
of the wave in the metal, and the exponential fall-off of the amplitudes is characterized by the skin 
depth. 



This leads to the transmittance 



a 



(9.23) 



Usually, the transmitted vs^aves will be absorbed completely unless the metal is sufficiently 
thin. When cr ^ oo such as in a superconductor, the transmittance drops to zero and 
everything is reflected. 



9.3 Wave guides 

In this section, we consider waveguides with conduction walls. Since there are no fi^ee 
charges or currents inside the waveguide. Maxwell's equations can be written as 



V-E = 



dB 

VxE = 

dt 



VB = 



1 dE 

VxB= ^ — . 
c2 dt 



(9.24) 



We are interested in the behaviour of monochromatic waves of frequency to and wave 
number k inside the waveguide, so we assume that the fields take the generic form 



E = Eoe 



ikz-i(i)t 



and 



B = Boe 



ikz-iojt 



(9.25) 



In terms of the components Ex, Ey, Ez of Eq and B^, By, Bz of Bq, the two equations for 
the curl become 



dEy 


dEx 


dx 


dy 


dEz 


dEy 


dy 


dz 


dEx 


dEz 



dz dx 



icoBz 
i(i)Bx 

i(jl)By 



and 



dEy 


dEx 


dx 


dy 


dEz 


dEy 


dy 


dz 


dEx 


dEz 



dz dx 



It) 

—^Ez, 

7-, 

iw 

^Ey. 



(9.26) 
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Figure 33: A rectangular hollow wave guide with conducting walls. 



Note that we do not set co = ck, since the walls of the waveguide may impose a different 
dispersion relation. We can express E^, Ey, B^, and By in terms ofE^ and B^, yielding 



Ex 
E 
B 
B 



i 
i 
i 



k 



k 



dE, 

dx 
dE, 

dy 
dB, 

dx 
dB, 

dy 



+ 0) 



dB, 
dy 
dB, 

0) 

dx 
(D dE 



dy 



+ 



ii) dE, 



dx 



(9.27) 



Substituting these equations into V-E = and V-B = then gives the differential equations 

^2 ;,2 



+ ^ + (-1 -k^ 



dx^ dy^ ^ c ' 
d'^ 

dx^ dy^ V c 



E, = 0, 



B, = 0. 



(9.28) 



The other components of Eq and Bq can then be obtained from Ez and Bz by differentiating 
according to Eq. (9.27). 

Exercise 9.2: Derive Eq. (9.28). 

Since the waveguide is oriented in the ^-direction, one would ordinarily expect the 
longitudinal Ez and Bz components to vanish. However, inside a waveguide we can have 
the situation that the wave is not travelling straight, but bounces off the surfaces at an 
angle. In this case Ez and Bz are not necessarily zero. We classify electromagnetic waves 
in a waveguide by theire transverse properties. A transverse electric (TE) wave has no Ez 
component, and a transverse magnetic (TM) wave has no Bz component. If both Ez = 
and Bz = 0, we speak of TEM waves. These cannot occur in a hollow waveguide, but the 
can occur in waveguides with a conducting core, such as a coaxial cable. 
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The rectangular hollow waveguide 

As an example, consider TE waves in a rectangular waveguide of sides a and h <a shown 
in Fig. 33 with perfectly conducting walls. This means that the fields inside the walls are 
zero, and the boundary conditions AE|| = and AB^ = become 

E||=0 and Bi = 0. (9.29) 

There will be induced charges and currents on the surface that are hard to calculate, so 
we see how far we can get with these two boundary conditions. We have to solve Eq. (9.28) 
with = 0, and we proceed by separation of variables with the Aresate Bz{x,y) = X{x)Y(y), 
such that 



2v ^2v 

" XY = 0. (9.30) 



Y^+X^ + {—-kA 



As we have seen, this is equivalent to solving the two ordinary differential equations 

1 d^X 9 1 d^Y o 

kl and —^^ = -kl, (9.31) 



X ^ Y dy 

subject to the constraint 



-kl-kl + '^-k^ = 0. (9.32) 

The standard solution is 

X(x) = Asin(kxX) + Bcos(kxX) and ^(y) = Csin(^y3/) + Z)cos(^3,3/). (9.33) 

The boundary conditions require that = at x = and x = a, and By = Oaty-0 and 
y = b. To find the effect of the boundary conditions on A and B, we substitute Eq. (9.33) 
into Eq. (9.27) and set B^ = 0. We find that dX/dx = 0, and therefore A = in Eq. (9.33). 
This leads to specific values ofkx and ky: 

mn , , nn 

kx = and ky = —, (9.34) 

a h 

where m and n are integers. The solution for B^ then becomes 

(mnx\ (nny\ 
52=5ocos(-^Jcos(-^J . (9.35) 

Each pair im,n) forms a particular solution of the field in a waveguide, and these are called 
the TEto„ modes. By convention, the first index refers to the largest side of the rectangle 
(here a). 

The wavenumber of the waves travelling inside the waveguide are found by substitut- 
ing Eq. (9.34) into Eq. (9.32): 
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If the frequency of the wave is smaller than a certain threshold, the wave number k be- 
comes imaginary {k = ix), and the solution does not propagate throughout the waveguide: 

B = Bo e"''^ e-^'"^ (9.37) 

This happens at the cut-off frequencies 



^mn = C7l]^ —^ + -^. (9.38) 

The lowest cut-off frequency is a>io = cn/a. No frequencies smaller than this value can 
propagate through the waveguide. The wave number for the TEmn mode can then be 
written as 

1 

c 

and the phase velocity Vp is calculated as 



k = -\/a)^-ojlri, (9.39) 



(i) c 

Vp = - = ^= - . (9.40) 

This is greater than c, since w > for propagating waves! Fortunately, the group veloc- 
ity Vg is well-behaved: 



cyi-(ci)niM^<c. (9.41) 
This is the velocity of the energy in the wave. 

Exercise 9.3: Calculate the surface charges and currents from Eq. (9.35). 
The coaxial transmission line 

As a second example, consider a coaxial cable. The first thing to note is that such wave- 
guides do admit TEM waves, since the core of the waveguide can be held at a different 
potential from the outer wall. In this case, the two Maxwell's equations in Eq. (9.26) lead 
to the ordinary wave number 

(1) 

k = -, (9.42) 

c 

so the waves behave just as free-space waves: there is no dispersion and they travel at the 
speed of light in the medium between the core and the outer wall (c in the case of vacuum). 
Specifically, the coaxial cable does not act as a filter with some cut-off frequency, and is 
therefore particularly suited for the transmission of a wide variety of wave signals. 

The fields inside the coaxial cable obey the standard relations cB^ = -Ey and cBy = E^, 
and Maxwell's equations become 

dE^ dEy dEy dE^ 

+ — ^ = and —^-—^ = 0. (9.43) 



dx dy dx dy 
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Figure 34: The coaxial transmission cable. 
In cylindrical coordinates (p,(f),z) this leads to the fields 

Eq 

'Eip,(p,z,t) = — cos(^2 - cot) p, 

p 

Eq 

Bip,(h,z,t)= — cosik z- cot) (f). (9.44) 
cp 

A coaxial cable has surface currents and charges in the core and the wall, so it will 
have both a capacitance C and inductance L. These can be calculated as 



C = , ^ ^ and L = 7^ In ^ ^ 
ln(&/a) 2n 



a 



(9.45) 



We can use these to define the impedance Z as 




(9.46) 



If the coaxial cable is plugged into a receiver that has a different impedance, part of the 
signal will be reflected back into the cable. This is called impedance mismatch. The units 
of impedance are Ohms (Q). 

9.4 Resonant cavities 

Imagine that we again have the rectangular waveguide with width a and height b, but 
this time we close off the waveguide with a perfectly conducting wall at 2 = and z = d. 
What are the TE modes? 

To answer this question, we note that we can use the solutions for the rectangular 
waveguide from the previous section, with the TE^„ mode 



B^=B 



ocos(^)cos(^)e". (9.47) 



We now have to impose additional boundary conditions in the 2 -direction, noting that 
Bx = at the boundary. This means that the magnetic field becomes 

B2^fi^=fiocos(-^Jcos(-^Jsin(^J e (9.48) 
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and the cavity modes are standing waves with frequencies 




^rnnp^Cn^'-^ + -^ + -^. (9.49) 

These results are valid for perfectly conducting walls that do not allow fields to penetrate. 

In real cavities, the walls do not have perfect conductivity, and fields will propagate 
into the walls. The amplitude of the fields drops off with a length scale given by the skin 
depth, and the fields can leak out of the cavity into the walls (and possibly beyond). One 
of the most important characteristics of a cavity is therefore how well the cavity retains 
the fields inside. Since the fields carry energy, one way to define the quality of a cavity is 
to set up an energy balance. For a wave with frequency mq, the quality factor, or Q factor 
of the cavity is given by 

EM energy in the cavity 

Q^(^o—=, — . (9.50) 

Energy loss per cycle 

Since the energy loss per cycle is just the negative time derivative of the energy we can 
write 

where U is the energy in the cavity. We can solve this straight away: 

Uit) = Uo e"'""*^^ , (9.52) 

which is an exponential damping of the waves int he cavity. The higher the value of Q , the 
smaller the damping. The electric field in the cavity will then have the form 

E(^) = Eo e-'^oi/Qg-^(t^o+Aw)f ^ (9 53) 

where we included a shift in the frequency Aco that typically accompanies damping. We 
want to look at the spectrum of the electric field E(tL>), so we take the Fourier transform of 
E(^): 

Eiw)=— e-'^ot/Q^-i{m+^<^)t^f_ (9 54) 

The electrical energy density in the cavity is equal to ^eo|E((w)|^, which can now be calcu- 
lated as 

1 9 colEol^ 1 

-eolE w)|^ = ^^^-^^ ^ ^, 9.55) 

2 " 4n {CO - coo -Aa))^ + {0)0 2Q)^ 



This spectrum is shown in Fig. (35). Narrower shapes indicate better cavities. You can 
determine Q from the plot by dividing coo with the full-width at half maximum T. 
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Figure 35: The spectrum of a cavity with finite Q factor 



The higher the Q , the longer the field remains in the cavity, and the stronger the inter- 
action betvi^een the field and an atom, ion or quantum dot placed in the cavity. Values of 
Q = 10^ have been achieved in microvi^ave cavities v^^ith superconducting vi^alls. These cav- 
ities are good enough to make single photons interact very strongly with Rydberg atoms. 
In optical cavities, values on the order of Q = 10^ to 10* can be realistically achieved. 

The Q factor depends on the shape of the cavity, the permeability of the interior cav- 
ity fj-c and the vi^alls [j., and the skin depth 6. The Q factor is proportional to the relative 
strength of the permeabilities ;u//ic. From Eq. (9.50) the energy stored in the cavity is 
proportional to the volume, vi^hile the energy loss per cycle is proportional to the surface 
(though which the energy must flow) multiplied by the skin depth. The Q factor is there- 
fore also proportional to the ratio V/S6. Finally, there is a form factor f that depends on 
the specific geometry of the cavity: 

Li V 

jUc So 

Smaller skin depth (higher conductivity) and maximised volume to surface ratio (i.e., more 
spherical cavities) will help obtain a higher Q. 

An interesting application of cavity theory is the Earth's ionosphere. The ionosphere 
is a plasma, and consequently for low enough frequencies (long wavelengths) it acts as a 
mirror. Since the earth is also a conductor, low frequency radio waves (AM) can bounce 
between the ionosphere and the Earth to reach much further than high frequency (FM) 
radiowaves. The plasma frequency of the ionosphere is about 3 MHz. 
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10 Relativistic formulation of electrodynamics 

One of the remarkable things about the Maxwell equations is that they give rise to a 
correct description of all sorts of radiation, most notably light. In particular, it predicts 
the correct velocity of light in vacuum via the relation (? = iiJ,oeo)~^, as we have seen in 
Section 5. Experimentally it was found by Michelson and Morley in 1887 that the velocity 
of light is independent of the velocity of the source. This caused Lorentz quite a headache, 
which ultimately resulted in the Lorentz transformations for moving bodies: Objects that 
move with respect to an inertial observer are seen to experience a length contraction in 
the direction of their motion, and their clocks seem to run slower. 

There are similar rules for the electromagnetic field, meaning that what one inertial 
observer calls the E field, a second observer may describe in terms of both E and B fields. 
We therefore want to know two things: (1) How do the fields transform from one coordinate 
system to another, and (2) What do Maxwell's equations look like when written in covari- 
ant form, that is, independent of the coordinate system? We will first revise the basics of 
special relativity and Minkowski space, and then we will construct Maxwell's equations 
from the vector and scalar potentials, and the fields. This leads naturally to the trans- 
formation laws for the E and B fields. We end with a look at some invariant properties. 
This section will rely heavily on the transformation properties of vectors and tensors, and 
everything will be in index notation. 



10.1 Four-vectors and transformations in IVIinkowski space 

Remember that covariant descriptions take place in four-dimensional Minkowski space. 
There is the position four-vector x^, where = 0, 1,2,3, r = (x^,x^,x^) or r = xi+yj+zk, and 
x° = ct, and the momentum four-vector with p = ip^,p^,p^) and = U/c. Minkowski 
space is a strange place in that it has a non-trivial metric: Ordinarily, if you want to 
find the length of a (short) interval, you add all the components squared, according to 
Pythagoras: 

ds^ = dx^ + dy^ + dz^. (10.1) 

However, in relativity the length changes in different reference frames, and we need to 
include the change in the time coordinate as well. It is tempting to just add c^dt^ to 
Eq. (10.1), but that would be wrong! The correct distance between two events is 



ds' 



-c^dr + dx^ + dy^ + dz^ 



(10.2) 



with a minus sign in front of c'^dt^\ This is the definition of the distance in Minkowski 
space, and we can rewrite this as a vector equation (and also using Einstein's summation 
convention): 



ds ={ct,x,y,z) 



-10 
10 
10 
1 





'ct 




X 




y 


J 





(10.3) 
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where the four-vector can be written as (ct, x, y, z). Notice that the matrix g^y has a "- 1" 
on the diagonal, which is responsible for the minus sign in the metric. You also see that 
there is a difference between upper and lower indices. The four-vectors with upper indices 
are called contravariant, and the four-vectors with lower indices are called covariant. You 
can verify from Eq. (10.3) that the covariant four-vector has a relative minus sign in the 
first component: 

x^-i-ct,x,y,z). (10.4) 

We are cheating a bit here, because the x^ on the left is a component, while the quantity 
on the right is a proper vector. However, this expression is to emphasize the effect of the 
vertical position of the index; it is not a proper equation. As you can see from Eq. (10.3), 
the metric raises or lowers the index, and g^v = g^^ in the case of special relativity. 

We now amend the Einstein summation convention: the sum is implied over two re- 
peating indices, one of which is upper, and the other is lower. Carrying out the sum is 
called contraction. We can construct invariant quantities (scalars) by contracting con- 
travariant four-vectors with covariant four-vectors. In general we write the components of 
a four-vector as {aP ,a^), and we have: 

a^h^ = a^g^,y = Y^a^'g^,b'' 

/i,V 

3 

= -a%'^+Y,a^bJ ^-a^b^ + a-h, (10.5) 

where greek indices sum over all four components, while roman indices sum only over the 
spatial part^. 

As an example, let's look at the four-momentum of a particle: p = (U/c,p^,p^,p^), and 
p^ = p^p^ can be written as p^ = -U^/c^ + Ipl^- Since this is true for all inertial frames, it 
is true for the frame where the particle is at rest, so |p|^ = 0. Using U = mc^ with m the 
rest mass of the particle, we find p^ = -m^c^ (yes, in Minkowski space a square can be 
negative without complex numbers!), and the energy of a particle is therefore 

[/= y'm2c4 + |p|2c2. (10.6) 

This is one of the most important formulas in physics. 

Any four-vector is transformed into due to a Lorentz transformation A'^ ^ via 

a^' = A^'^a^ = ^A^>^. (10.7) 

Note that the primed frame of reference is indicated by a primed index fi'. The Lorentz 
transformation involves a contraction over the old (unprimed) coordinates in order to re- 
move them from the equation. In general, a tensor transforms as 

T^'^' = A^'^A^'vT^\ (10.8) 

^You should be aware of the fact that the metric gf^y can also be written as a diagonal matrix with three 
-1 entries for the spatial part and one +1 for the temporal part. This does not lead to observable differences 
in the theory, so it is a convention. Unfortunately, both conventions are used regularly, so make sure you 
know what the metric is before you copy a relativistic formula from a book! 
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Every component is transformed with a separate Lorentz transformation A*^ ^. The actual 
form of the Lorentz transformation, for example for a boost in the z direction, is 



r 








-7)6 





1 














1 





7)6 








r 



(10.9) 



with 7 = 1/ - )6^ and )8 = u/c. In component notation the Lorentz transformation = 



A^^ ^x^ becomes 



x^' = yx^ - rl5x^ 

2' 2 

x^' = -yPx^ + yx^ = 



Vl - v^/c 



1 ( vz\ 
let 



-.(z-vt). 



Note that the Lorentz transformation is a proper coordinate transformation: 



(10.10) 



(10.11) 



Using these rules it is straghtforward (but somewhat lengthy) to find the transformation 
rule for any tensor. 



10.2 Covariant Maxwell equations 

Just like the position and momentum four-vectors, we would like to construct electric and 
magnetic four- vectors. Unfortunately, it is not that simple: There are six field components 
(three for E and three for B), and we cannot force them into four-vectors. On the other 
hand, we can combine the vector potential and the scalar potential into a fourvector A^: 



A^(x) = 




(10.12) 



where x is again the position four-vector. The second four-vector that we can construct 
straight away is the current density j^^: 

fix)=[cpix),Jix)]. (10.13) 

Since the Maxwell equations are differential equations, we also need to construct a 
four- vector out of the differential operators. Again, they come in two variations, covariant 
dn and contravariant d^: 



Qti- ^ -L^^ vl and a - ^ -f"*^^ vl 



(10.14) 
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These operators transform just like ordinary vectors (but note the vertical position of the 
indices). We can construct an invariant with these operators if we contract them: 

dud^ = dl + dl + dl-\d^ = a. (10.15) 
c 

This is the d'Alembertian, which we first encountered in Section 3. 

The continuity equation (conservation of charge) then becomes particularly compact: 

a^j^ = o. (10.16) 

Compare this with Eq. (2.21); the two equations say exactly the same thing! The Maxwell 
equations in Eqs. (3.12) can be written as 



(10.17) 



Often people say that this encompasses all of electrodynamics. However, you should re- 
member that we also need to know how charges respond to the fields, that is, we need to 
know the Lorentz force. Since this force depends explicitly on the velocity of the particle, 
it is not in covariant form. We will consider the covariant Lorentz force later. We also 
see explicitly that the Maxwell equations are already relativistically correct: We did not 
change anything in Eqs. (3.12) and (3.13), we just rewrote everything. The compactness 
of Eq. (10.17) indicates that Lorentz invariance is a symmetry of electrodynamics (The 
more symmetric the object, the more compact we can make its description: For example, 
compare the description of a perfect sphere with the description of a sponge). Lorentz 
invariance is one of the most important symmetries in Nature. 

Maxwell thought that the electric fields were carried by some substance that pervades 
all space, called the aether. Such a substance defines a natural reference frame, which 
would break Lorentz invariance. It was Einstein who disposed of the aether and made 
all inertial reference frames equivalent. Thus he uncovered that electrodjmamics was a 
relativistic theory all along. In fact, his famous paper of 1905 on special relativity was 
titled "On the electrodynamics of moving bodies". 

In addition to the Maxwell equations in Eq. (10.17), we need to specify a particular 
gauge for the four- vector potential A'^ if we actually want to calculate anj^thing. The 
relativistically invariant gauge is the Lorenz^ gauge 

^^A^' = 0. (10.18) 

Now that we have constructed the four-vector potential for the electromagnetic field, 
we can ask what the E and B fields look like in a relativistic setting. We can use the 
relations 

E = -V(I)-— and B = VxA. (10.19) 

dt 



'^This is not a typo: Lorenz and Lorentz were different guys. 
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If we write out a few components we get, for example, 

dx 



dA, 



dt 



-id^A^-d^A'^). 



(10.20) 



dy dz 

Obviously we can only fit all six components of the electric and magnetic field into a rel- 
ativistic object if we have something with two indices. We call this the (anti-symmetric) 
field-strength tensor F: 

( 

-B^ B, 
B^ 





-EJc 



-Ey/c 
-EJc -B 



EJc Eyic EJc\ 

-Bz Uy 

-Bx 




y 



B, 



(10.21) 



The relativistic transformation of the fields then becomes 



For a boost v in the z direction given by Eq. (10.9) we find 

E'^ = Y[Ex + vBy), E'y = r[Ey-vBx), 

B', = r {bx - ^Ey] , b; = r (By + ^e,] , 



E'=E, 



B',=B, 



(10.22) 

(10.23) 
(10.24) 



In terms of the parallel and perpendicular components of the fields with respect to the 
direction of motion this becomes 



e;,=e 

Bii = Bii 



II 



E^ = 7E_L + 7V X B 
. _ vxE 

B^ = rBx-r^-. 



(10.25) 



A physical situation with only a static charge distribution and no currents obviously 
does not involve B fields. The magnetic fields arise when the charge distribution is viewed 
by a moving observer. In particular, the electrostatic force F = gE aquires a term propor- 
tional to V X B, so that F' = gE + gv x B (verify this!). This is of course the familiar Lorentz 
force. In other words, when an observer sees a moving electric charge, the E field is not 
spherically symmetric due to Lorentz contraction in the direction of motion. As a result, 
there is a component of the electric force that acts only on moving charges. This is the 
magnetic field. This argument shows that you can think of the magnetic field as just the 
relativistic part of the electric field of moving charges. Whenever there are B fields, there 
must be moving charges, whether it be macroscopic currents or spinning charges in atoms. 
The field lines of a stationary and moving charge are shown in Fig. 36. 

Finally the Lorentz force must be formulated in a covariant way. The force experienced 
by a particle with charge g, as viewed from the lab frame, is proportional to the velocity, 
so we need to define a four-velocity u. We can use the relation 



(10.26) 
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Figure 36: The field lines of an electric charge, (a) The charge is stationary, (b) The charge is 
moving with velocity v. 

such that u^u^ = -c^ (verify this!). Rather than deriving the relativistic Lorentz force, 
vi^e w^ill state it here and confirm that it behaves the vi^ay it should. Suppose that t is 
the proper time of a particle, as recorded by a co-moving clock. The force will cause a 
momentum transfer dp^ldr. This must be proportional to E + v x B. The cross product is 
built in the field strength tensor F^"^ , so we guess that the covariant form of the Lorentz 
force is 

= ^ = ^^vF^' = qF^^^ . (10.27) 

These are the equations of motion for a relativistic particle in an electromagnetic field. 
Let's see what the different components of f look like: 



fy 



dr 
dp^ 

di 
dp^ 

di 



qF^^Uy 


= yq 


( 

c — 




I c 


qF^^u, 


= rq 


f Ey 

c — 




I c 


qF^^Uy 


= rq 


f E, 
c — 



+ Vx-0+VyBz-VzBy 



VxBz + Vy-O + VzB, 



+ v^By-VyB^-Vz-0\ (10.28) 



These are indeed the three components of the Lorentz force, with an extra factor j. This 
is correct, because the term dp^ldi also has an implicit y factor in the relativistic mass. 
Remains the question of what is ft: 

eft = c-^ = qcF^^Uy = jq [v^-E^ + VyEy + v,E,) = yg v E. (10.29) 

This is the relativistic work done on the particle by the fields. So you see that it all works 
out. 

10.3 Invariant quantities 

When the fields and potentials change when viewed in different inertial frames, it is im- 
portant to know what the invariant quantities of a theory are. These quantities are the 
same in all reference frames, so you can evaluate them in whichever frame makes the cal- 
culation easiest. There are many invariants, most notably the magnitude of four-vectors 
a^ - a^a^. You can use the differential operator as well in the construction of invariants 
(see Eq. (10.13) for an important example). A quantity is invariant when it does not de- 
pend on the coordinates, so in index notation the quantity has no indices. 
The quantity ^F^^F^^ is invariant (why?), and can be evaluated as 

\F,yF^^ = Igp.g.yFP'^F^^ = T " £ • ^^^-^^^ 
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This is the Lagrangian (density) of the electromagnetic field, which plays a pivotal role in 
particle physics. 

We can also construct the dual of the field-strength tensor 



pa 



'0 


-Bx 


—By 


-bA 


Bx 





EJc 


-EyIC 


By 


-E,/c 





EJc 




Ey/C 


-EJc 


J 



(10.31) 



which is a pseudo-tensor. Here we used the Levi-Civita (pseudo) tensor e^'^f"^ , which re- 
turns 1 if the indices make an even permutation, -1 if they make an odd permutation, 
and otherwise. The invariant product F^y^^^ is 



(10.32) 



Finally, the microscopic Maxwell equations can be written in terms of the fields as 



d^F^'^ = Hof and d^^^^ = 



(10.33) 



These equations plus the Lorentz force encompass all of electrodynamics. 
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A Special coordinates 

Often it is much more convenient to use cylindrical or spherical coordinates, rather than 
cartesian coordinates. The differential operators change accordingly. For cartesian coordi- 
nates: 



AxB 

V/- 

V-A 

Vx A 



(AyB^ -A^By)x + iAzBx-A^Bz)y + (A^By -AyBx)z 

df df df 

— x^ j)H z 

ox dy dz 

dAr dAy dA. 



dx 
idA, 



+ 



dy 
dA 



+ 



dz 



dy dz 



y \ 



dAr dA, 



. dz 



dx 



y + 



(dAy dA, 



, dx dy 



(A.1) 
(A.2) 

(A.3) 

(A.4) 



z 




Figure 37: Cartesian coordinates (x,y,z). 



Cylindrical coordinates: 



V/- 

V-A 

VxA 



df Id/, df 

— p+ -—<p+ —z 
dp p d(p dz 

1 dipAp) 1 dA^ dA^ 
p dp P d(P dz 



1 1 dA, dA 



p d(p dz 



P + 



IdAo dA, 



1 d ( df 
P 



Pdp 



+ 



dz dp 
^d^f 



dp I d(p^ dz^ 



. 1 IdipA^) 
0+- 



dA, 



dp d(p 



(A.5) 
(A.6) 
(A.7) 
(A.8) 
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Figure 38: Cylindrical coordinates (p,(p,z). 



Spherical coordinates: 

V/- 

V-A 



VxA 



df Id/. 1 df . 

—r+ — -d+ -(h 

dr r dd rsind d(p 



1 dir'^Ar) 



+ ■ 



1 diAgsind) 



dA 



dr rsind dd 
tdiA^sinO) dAg 



+ 



4> 



rsind d(p 



rsind 
+ - 
1 d I 

J.2 Qj. 



dd dip 
1 dAr d{rA^)\ 



sin0 d(p 
2df\ 1 



dr 
d ' 



9 + 



d(rAg) dAr 



dr 



+ 



dr j r^sinddd 



1 



de 

d^f 







smt^ — + 

dd) r2sin2 0d02 



(A.9) 
(A. 10) 
(All) 
(A 12) 
(A 13) 



z 




Figure 39: Spherical coordinates (r,(j),9). 
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B Vector identities 

Having defined the divergence and the curl of vector fields, we can state the important 
integral theorems (without proof): 



f V-AdV={ A-dS 
Jv Js 

( (VxA)-dS=f A-dl 
Js Jc 



(Gauss' theorem) 
(Stokes' theorem) 



(B.l) 

(B.2) 
(B.3) 



and some important vector identities, such as relations between 3-vectors: 





A-(BxC) = B-(CxA) = C-(AxB) 


(B.4) 


Ax(BxC) = (A-C)B-(A-B)C 


(B.5) 


(AxB)-(CxD) = (A-C)(B-D)-(A-D)(B-C) 


(B.6) 


first derivatives: 






V-icpA) 


= (/)(V-A) + A-(V(/)) 


(B.7) 


V X ((pA) 


= (/)(VxA) + (V0)xA 


(B.8) 


V(A-B) 


= Ax(VxB) + Bx(VxA) + (A-V)B + (B-V)A 


(B.9) 


V-(AxB) 


= B-(VxA)-A-(VxB) 


(B.IO) 


Vx(AxB) 


= (B-V)A-(A-V)B+A(V-B)-B(V-A) 


(B.ll) 


and second derivatives: 








V-(VxA) = 


(B.12) 




V X (V(p) = 


(B.13) 




Vx(VxA) = V(V-A)-V^A 


(B.14) 


Kronecker deltas and Levi-Civita symbols: 






£ijk£klm = Sil6im,-Sii6im- 


(B.15) 
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C Units in eiectrodynamics 

Traditionally, electrodynamics is plagued by the use of many different sets of units. In 
these lecture notes we use SI units (Systeme International) exclusively. However, many 
people also use Gaussian units, so it is worthwhile to spend some time exploring the dif- 
ference betwen these units. In particular, you should be able to read the set of units used 
off the Maxwell equations. The rule of thumb is: if they involve factors of 4n, then they 
most likely use Gaussian units. 

Carl Friedrich Gauss was a German mathematician who in the 1830s did (among very 
many other things) worldwide measurements of the Earth's magnetic field. Not only are 
the Gaussian units named after him, the unit of magnetic induction in Gaussian units is 
also called the Gauss (G). In SI units the unit of magnetic induction is the Tesla (T), after 
the Serbian inventor Nikola Tesla (1853-1943). 

There is a certain arbitrariness in Maxwell's equations, the Lorentz force, and the 
continuity equation, in the sense that we can add constants ki, k2, ...to the equations 
that affect only the units, and not the physics: 

dp 

kiV-J+-^ = and F = yfe2pE + /feaJ x B (C.l) 

V-E = ^4p and V-B = (C.2) 

dB dB 

VxE + ^5— = and VxB = ^6J + ^7^ (C.3) 
dt dt 

Not all the constants to ^7 are independent. Some manipulation of the above equations 
will give the identity 

/feeV-J + M?— = (C.4) 

dt 

so we have the restriction that ke = kik^kq. Similarly, by rederiving the wave equations 
we find that k^kq = c~^. 

More confusion arises when we consider macroscopic media. Gauss' law in terms of the 
polarization field becomes 

V-(E + ^4P) = ^4Pfree. (C.5) 

which defines the displacement field 

D = ^8(E + /fe4P). (C.6) 

Similarly, with a bound current density Jbound = V x M -I- P the Maxwell-Ampere law be- 
comes 

d 

V X (B - k^M) = keJfree + ^7 — (E + ^4?) . (C.7) 

dt 

The magnetic field is then defined as 

H = ^9(B-^6M). (C.8) 



The different constants ^1 ... ^9 in both SI and Gaussian units are given by 
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k\ k2 ks k4 Uq kg kg 


SI units 
Gaussian units 


1 1 1 1 luo c"^ eo Mo^ 
1 1 47r 47rc-i c'^ 1 1 



As you know, in the SI system the fundamental units are meter (m), kilogram (kg), 
second (s), and ampere (A). The unit offeree is the Newton (N), which is of course kg m/s^. 
Just like the second and the kilogram, the ampere is defined operationally. One ampere is 
the amount of current needed to create a force of 2 • 10~^ N per meter length between two 
infinitely long parallel wires separated by a distance of one meter. 

In the Gaussian system, the fundamental units of mass, length, and time are the gram 
(g), centimeter (cm), and again the second (s). Instead of the ampere, the other fundamen- 
tal unit is the statcoulomb (statC) for the unit of charge. The statvolt (statV) is the unit of 
potential, and obeys 1 statV = Ig cm/statC s^. 





charge 


current 


E 


B 


J"0 




SI 


C = As 


A 


N/C = V/m 


T = N/Am 


Tm/A 


C^/Nm^ 


Gauss 


state 


statA = statC/s 


statV/cm 


G(statV/cm) 







In Gaussian units the permittivity and the permeability are dimensionless. 



